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SOME DISTRIBUTIONS OF SAMPLE MEANS 

George W. Brown and John W. Tukey 
ECA Laboraioncs and Princeton University 

1. Summary. It is shown that certain monomials in normally distributed 
quantities have stable distributions with index 2“*. This provides, for fc > 1, 
simple examples where the mean of a sample has a distribution equivalent to 
that of a fixed, arbitrarily large multiple of a single observation. These examples 
include distributions symmetrical about zero, and positive distnbutions. 

Using these examples, it is shown that any distribution with a very long tail 
(of average order > has the distributions of its sample means grow flatter 
and flatter as the sample size increases. Thus the sample mean provides less 
information than a smgle value. Stronger results are proved for still longer 
tails. 

2 Introduction, This paper derives and exploits certain elementary ex¬ 
pressions for stable distributions. The practicing statistician may be inter¬ 
ested in Uie general di.scussioii of results, going as far as Section 5. The reader 
interested in probability theory may be interested in 

(i) the simple monomials in normally distributed quantities which are 
shown to be stable (Section 7) 

(ii) the resulting bounds on the densities of these stable distributions 
(Section 8) 

(iii) Theorem A, which forms a partial converse to the Central Limit 
Theorem. 

It should be pointf’d out that examples of stable chance quantities arising from 
infinite series (Khintchine 1937, [2], [3)) Bird integrals (I.«vy 1935, [4]) are already 
known. These results form a natural part of broader investigations into 

(i) the relative value of the mean, the median, and their competitors 

(ii) the properties and distributions of simple functions of normally dis¬ 
tributed quantities. 

3, Stable distributions. One of the typical properties of the normal dis¬ 
tribution with zero mean is that the distribution of the mean of a sample of n 
has the same, shape but is compreissed by the factor Vn. The Cauchy dis¬ 
tribution is well-known for the property that the mean of a sample of n has 
the same distribution os a single observation. 

Statisticians have not widely appreciated the fact that them are symmetric, 
smooth distributions for every positive X ^ 2, with the property that the dis¬ 
tribution of the mean of a sample of n has the same shape as the original dis¬ 
tribution but is spread out in the ratio These are the symmetric stable 

distributions of index X. 

It is interesting to note that if X = .001, then the mean of a sample of two 
is times as variable as the mean of a sample of one For small X the means 
become unduly variable ivith a rapidity which is difficult to comprehend. 

1 
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4, Outline of results. Section 7 is devoted to the proof that certaitr mono¬ 
mials in normal variables are stable of index 2"* for integnil k. Both wymmetri- 
cal and positive cases are shown to exist. For fc = 0, the bymmetrical raw is 
the familiar Cauchy distribution, which is the distribution of Student’s “i” 
no one degree of freedom, while the positive case for A: » 1 is tht> flietribiition 
of Snedecor’s “F” on « and 1 degrees of freedom. 

In Section 8 it is shown that the symmetrical stable distribution of index \ 
has a density which is 

(i) bounded by a constant 

(ii) bounded by a constant times | x for the value.s X ™ hi h i, 
• ■ , for which elementary examples are available. It i.s conjeriurc'd that 
this is true for all X < 2. 

In section 9 it is shown that, if a distribution has one long tail in tho wnsn that 

(11) lira 1 a: 1^+^ ?(« < X < a: + h] > 0, 

for some h and one of the above values of X (the bra may I«* taken either as 
s —> + CO or as a: -> —«>), then the distribution of the sum of a sarupU* of n 
spreads out as fast as for a stable distribution with the same* value of X. This 
may be restated for the mean as follows: 

(i) A distribution has a long tail of order | x if (i.n Iwhh for wm 
h > 0 and choice of sign for x, 

(ii) If the distribution has a density f(x), then (1.1) ia a ctiimqwticr of 

(12) f(x) > j-;pj^jrFX. > 0. 

(iii) The distribution of the mean of a sample of n wiU be said to spread out 
as fast asn , if the distance between any two percentage points for the mean of a 
sample of n is uUimalely larger than a fixed multiple of ri . 

(iv) Theorem A. If the distribution of X has at least one long tail of order 

I a: [ , where X = 1, :J, • • • , then the dislribulion of the mean of a sample 

of n values of X spreads out as fast as 

Section 10 presents a simple example of a distribution symmetrie about aero 
with such long tails that 

(i) the distribution of the sample mean spreads out faster than any power 
of n, 

(li) the median of a sample of any aiise fails to have finite momeuls of 
positive order, integral or fractional. 

5. Consequences for applied statistics. The basic consequences of these 
results for applied statistics can be summarized in the following statements, 

(a) The positions that the Cauchy distribution is an isolated ease, or else 
an extreme example of pathology, are now untenable. 
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(b) The use of the mean of a sample as a measure of location (or, when 
dealing with positive distributions 6xed at zero, as a measure of scale) im¬ 
plies a belief that the tails of the underlying distribution are not too long. 

(c) It is probable that the relative efficiencies of mean and median are 
greatly affected by the length of the taU, 

The importance of this last statement lies in the fact that direct empirical 
evidence about tail length is very hard to obtain. The mean is well known 
to be more efficient when the underlymg distribution is normal. Normality of 
the tails of practical distributions is rarely based on firm empirical evidence. 
In these practical cases, greater efficiency of the mean should often not be 
assumed without empirical confirmation. 

It may be argued that the results of this paper apply to the limit as n —> “ 
and to the behavior of the distribution near infinity, while the practical problems 
involve moderate values of n and the behavior of the distribution near its 5%, 
1%, 0.1%, 95%, 99%, and 99.9% points. This is undoubtedly true, but the 
authors believe, and have some evidence to confirm, the following correspon¬ 
dence principle: 

If certain mathematical tails imply certain asymptotic behavior, then 
similar practical tails imply similar behavior in moderate samples. 

Here “mathematical tails” refers to behavior at infinity while practical tails 
run from the 5% to the 0.1% point and from the 96% to the 99.9% point. 

It is of some interest to point out that Snedecor’s “F” provides applications 
of Theorem A. If U values of F are averaged, where each was obtained on,?ii 
and nj degrees of freedom, then as N increases 

(i) if nj > 2, the average converges to 1 (i.c; all percent points converge 
to 1), by the Central Limit Theorem 

(ii) if n 2 = 2, the percent points of the average stay a finite distance away 
from each other, by Theorem A 

(iii) if rii = 1, the percent points of the average separate frorp each 
other at least as fast as a constant times y/N, by Theorem A, 

The consequences of Theorem A follow from the asymptotic density of F, 
which is a constant times 

6. Notation and terminology. Chance quantities (random varialiles) 
will be denoted by capitals and their values by lower case letters. The same 
letter will generally be used, so that x will frequently be a value of X. 

The letter S, with or without indices, represents a standard dc'viate (nor¬ 
mally distributed quantity with zero mean and unit variance). tJnlo.s.s other 
wise specified all sets of chance quantities will be assumed to be independent. 

Cumulative distribution functions will be referred to simply as “cumulativea” 
and will be denoted by capitals. Probability density functions will be referred 
to as “densities” and will be denoted by the corresponding lower ease letters. 

The convolution of tAvo cumulatives F and G Avill be denoted by F*G. It is 
the cumulative of sums of tAvo independent values, one from each distribution. 
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7 Special stable distributions. Cauchy (1853, [1]) recognized that dis¬ 
tributions with characteristic functions of the form 

e 

were stable. A distribution is stable if whenever k and I are positive and A 
and B are independent chance quantities distributed acconling to the aame 
law, then kA + IB is distributed like a fixed multiple of A. It ia known (L6vy 
1937, [5]i PP- 94 ff') that any stable distribution has a characteristic function 
of the form 

e > 

where 0 < X < 2, a > 0, and 1 d i < I «tan JttX |. Eacli stable distribution 
thus has an index \ such that kA + IB and have the same dis¬ 

tribution when A and B are a sample of two from the given distribution. 

This section exhibits, for every integral k, simple monomials of atamlard 
deviates which have stable distributions of mdex 2'“*'. 

(7.1) Theorem; Let S, Sa, Si, 8%, • • • be a sequence of independent standard 
deviates. Then 

(i) Co « S/So and Po = 1 
are stable of index 1 = 2~“. 

(li) Cl = S/SoSl = Co/ 5’2 and Pi - 1/5? = Po/5? 
are stable of index ^ = 2“b 
(lii) C, « 5/SoS?S?’ = Ci/Sf 
and Pj = 1/S?5r = Pi/Si 
are stable of index i = 2~’. 

(iv) tn general, C* = Ck-i/Sf and Pk ~ Pk-i/Sl" 
are stable of index 2~^. 

The C* are a sequence of symmetrically distributed chance quantities which 
are here presented as monomials in normally distributed chance quantities and 
whose stability properties imply for lb > 1 that the distributions of weans of 
samples spread out as the sample size increases. The Pi, are a similar sequence, 
all of whose values are positive. 

The stability properties of the Ck follow, directly, by means of elementary 
composition properties of characteritic functions, from 

(7.2) Lemma: The characteristic function of Ck is 

£!(e“®*) = exp(—2 | I*"*). 

Proof; The case Ic = 0 is the familiar Cauchy distribution. DenutinK the 
normal cumulative by N{s), it is seen that 

dNis) dNis,) 

= exp (-^t’/s?) dA(so) 
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The second definite integral is well known (e.g. Formula 495 in B. 0. Pierce’s 
table). Assuming the result for fc—1, write 

= r r exp (ilCfc-i/sf) dF*-i(Ci_i) dNis,) 

J —00 *^00 

= I] exp (-2U«r*'^Vs?)dA(s,) 

= exp (-2 I 

precisely as in the derivation for A; = 0 . 

The stability properties of the P* follow, by completely analogous use of 
the moment generating function, from 
(7.3) Lemma; The moment generating function of P* is 

= exp(-2(iZ)^‘^), t > 0. 

Proof; The trivial case fc = 0 is verified directly, since Po s 1. The induction 
from k—1 to k is identical ivith the derivation of (7.2), as is seen by writing 

= [ r (-tPk-x/sT) dt?w(P.) dN{s,) 

<^00 V Q 

= r exp (-2(itr'''"/sl) dNis,) 

J —00 

= exp . 

In order to verify the stability properties, consider distributions with char¬ 
acteristic functions of the form exp{—d 1 1 ]’'). If A and B are independently 
distributed according to this distribution, then 

_ P(e'“'^)P(e'‘"‘®) = 

for I, m > Q. Parallel application of the moment generating function yields 
piecisely analogous results. , 

8 . Some auxiliary results. It is the purpose of this section to establish 
some results concerning stable distributions. It will be convenient to state 
and prove some of these lemmas in general form. 

(8.1) Lemma: If X has a density f{x) satisfying 

m <A\x r“, 

then X has finite negative moments of orders down to —(1 — a ), 

Proof: If — (I — a) < /9 < 0, then 

I * \^fix) ^ A 1 X 1-“+^ 

with —a+P > — 1. Now 

f |xf/(x)(ix< f f(x) dx + f |xl^/(x)cix+ f f(x) dx 

< f(x) dx -f jf A 1X dx < CO, 
which proves the lemma. 
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(8.2) Lemma: If X has a density fix) satisfying 

fix) <A\x\'‘' 

and if Y has a density giy) and a finik negative moment of order -il-a), Him 
the density hix) of XY satisfies 

hix) < ill 1 a: r”* 

Prooe: The density hix) satisfies 

hix) = r \fix/t)git)/\t]]dt 

< rA\tr\xrgit)\trut 

J—OO 

={£ i 11 r"““' git) ] X r «ill! X |-. 

(8.3) Lemma; The density hiy) of 

SiSdW' ••• iSk)'', 

where IS, Si, Si, <S* are independent standard dmaies, satisfies 

hiy)<A\yr^^\ 

and hence Yt, has finite negative moments of all orders down to —2““*. 

Proof: Let 3 i(x) be the density of 

X, = iS,)^\ 

then 

gi,ix) ~ (27r)'*2~'’exp(— 

whence 

9kix) < ill 1X 

For A: = 0 this is the desired result; the other cases follow by induction, using 
y*. « XkYi^i and lemma (8,2). The final statement of the lemma then follows 
from lemma (8.1), 

(8.4) Theorem: For X = 2“*’, the density m\ix) of Gh salisfiee 

(*) m^(x) ^A\x « A I X 

and also 

(**) mx(x) < Aj. 

Proof; By definition, C* = S/y*, By lemma (8.3) the density of Yk satisfies 

My) < Ailyr‘+^'*. 
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The density of l/F* satisfies 


m = I h,(x/z) 

z 


Since S has a finite moment of order 2“*", it follows from lemma (8,2) that the 
density of jS/F* satisfies the desired relation (*). Since S has finite moments 
of all positive orders, so does and therefore Ft. Thus 1/Ft has moments 
of all negative orders, including —1. Since the density of S is bounded, lemma 
(8,2) implies the same for S/Yk and hence for Ca, . This completes the proof 
of the theorem, 

9. Distributions with a long tail. The purpose of this section is to prove 
(9,1) Theoxiem; // D has a cumulaiive F{x) such that for some h > 0, eilher 


lim 


F(x + h) - F(x) 


-d+X) 


^ n lim + h) - Fix) ^ ^ 

> 0, or hm , > 0, 

I-* —80 C 


where X = 2 /or A) = 0, 1, 2, • • • , and if An(a) is the a-point (100a percent point) 
of the dislribution of sums of n independent values of D, Ihen 


11 m (a^l) *“ (^^ 2 ) 

— ^ ’ 


whenever ai > aj. 

We begin with some lemmas. 

(9,2) Lemma; If 

Fix) = ^F'ix) + (1 - p)F"ix), I 

0 < i3 < 1 

<?(a;) = ^F'(*) + (1 - ^)1(x), J 

where F'ix) is a cumulative symmetrib about zero and unimodal, F''ix) vs a cumula¬ 
tive symmetric about zero, and 1 (x) is the cumulative concentrated at zero (whence 
Fix) and Gix) are cumulalives), and if F^ix) and Gnix) are the cumulatives of 
sums of samples of n from Fix) and Gix) respectively, then 

F„ix) < 0„ix), X > 0, 

Fnix) > Gnix), X < 0. 

Proof : We begin with the case n = 2, where 

Fs = ^*F'*F' + 2,3(1 - p)F'*F" + (1 - 0fF"*r' 

and 

Gi = ^V*F' + 2^(1 - ft)F' + (1 - ,3)'1. 

The lemma will have been proved for n = 2 if we can show that 

F'*F"ix) < F'ix), a: > 0, 

F'*F"ix) > F'ix), x<0. 
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Now, if a: > 0, 

F'*F"{x) ~ f - s) dF"is) 

tjLgO 

= f lF'(x - 8) + F(a: H- 8)1 clF''(s) 

Jo 

< 2 r F'(x)dF"(s) ^ F'ix), 

Jo 

where the first equality follows from the symmetry of F\ the inequality follows 
from the unimodality of F', and the last equality follows from the symmetry 
of F". The inequality is reversed if x < 0. 

For general n, 





where Fi (the convolution of k copies of F') is the cumulative for Hums of k 
independent values from F', and F* is similarly related to F", Since is 
unimodal and symmetric and since P'Lk is symmetric, the same argument can 
be applied term by term to complete the proof of the lemma. The requirenmnt 
that F" be s 3 mmebric could be replaced by the formally weaker condition that 
Fi(0) = J for all fc. 

(9.3) Lemma; If 

F{x) = dF(X)(x) + (1 - /3)l(a:), 0 < <9 < 1, 

where Fn)ix) is ihe cumulaUve of Ck, with X « 2~*, and if K„(a) is as defined in 
(9.1), ihm 

lim 

where K(i,)(a) is the a-poinl for F,),)(x). 

Proof: Let and F(>)„ be the cumulatives of sums of n from P and re¬ 
spectively, whence 

Fauix) s= F(X)(n‘"‘x). 


" £(t)9‘a - 


Then 
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The characteristic function of (n^y'^x is 

where the characteristic function associated with F(X)(a:) is eip(—d | < |’'). Thus 
we have to deal with 



where h has a binomial distribution with mean n/J and variance nd(l — /3), 
so that h/n^ converges stochastically to unity. This implies that 

uniformly in every finite interval, whence converges stochastically 

to Ck, which completes the proof of the lemma. 

(9.4) Lemma: If the symmetric cumulative F{x) has a density f(x)^ andif constants 
Cl and cn exist such that 

fix) > min (ci , Cj I a: 
where X = 1, i, • • • , then, if a 9 ^ 

lim|n~^'^K„(a)l > 0, 

Proof: According to theorem (8.4) there are constants di and dj such that the 
density of Ck is bounded by min (di, d 2 \x Hence 

Fjx) - dF(X) (x) 

1 - 

is monotone when p = min (ci/di, CzM), and hence is a distribution function. 
By lemma (9.2) the a-points of F lie outside those of pF(K-\ix) + (1 — d)l(ic)) 
and these, by lemma (9.3), increase at least as fast as ArT^^^. 

(9.5) Lemma: If the density of D exists and equals fix), and if either 

Urn |a:l‘+V(^) > 0, 

*“++00 


Urn l®|‘+V(a:) > 0, 

where X = 1, i, J) • • * , then, for m > aj, 

lim n-^'^[K„i<xi) ~ K^ia^)] > 0. 
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Proof; Let Di and Dj be independent with the distribution of D, Then 
Di — Di has a symmetric density given by 

g(x) = f f(x + s)f (a)ds. 

J—eet 


If 


I»> 0, 


then for suitable h and « > 0, 

fix) > e I a: for all x > h. 

Therefore, for a: > 0, writing 7 = — (1 + X)> 

gix) > I f(x + s)f(s)ds > e’ j h + 1 + a: P I h + 1 r = 1 bj 4- a; r. 

Now 

bi 1 bj + a: P > min {bi2'^6J', bi2'' 1 1 p} 
and hence, for a; > 0 and suitable Ci > 0 , Cj > 0 , 

gix) > min {ci, Cs I X P). 

Since gix) is symmetric, this is also true for x < 0. If 

Urn I x p+V(x) > 0, 


then a similar argument proves the same result. 

Let Kj„(a) be the «-point for the sum of n values of Pi — Pj and Knia) be 
the a-point for the sum of n values of P. The most elementary relation be¬ 
tween these functions is 


1 ± Mm - Aif) 1 < 1 KM) - KM.) |. 

To see this, observe that the sum of a sample of n values of Pi — Pa is the 

difference of the sums of two independent samples of n values of P, and that 
there is a probability of (ai — that both of these sums will fall between 
K„(oii) and KM.)- Thus the intervals (— j fC„(ai) — KM.) |i 0) and (0, 

1 KMi) - Kniai) 1) are each occupied by the difference with probability 
> Mm) - Since KsM) = 0> fbe relation follows. Hence, if ai > «*, 

Urn n-''^{JC„(ai) - KM.)] > ± M«i “ «0®) 

and by lemma (9,4) applied to the distribution of Pi ~ Pj this latter Ito jg 

positive, which completes the proof of the lemma. 

With the ground prepared, it is now possible to complete the 
Proof of the theorem; Let h be chosen so that 

lun I -r 1^+’' CPCr 4- h\ _ -x. O 
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This can always be done, if Z is replaced by — Z when necessary. Let U have 
the uniform distribution on the interval (0, 1) and consider the variable D 4- hU. 
This variable has a density given by 


and, therefore, 

lira > 0. 

a-»-foo 

Let jFtnCa) be the a-point for the sum of a sample of n values of D, and let K*{a) 
be the a-point for the sum of a sample of n values of D + hU. Since | /if/ | <h, 
it follows that 

I Knia) - Ktia) I < nh. 

Therefore, if 1/X > 1 aird aj > ai, 

limn-"'^ {K„ (ai) - K. M) = lJmn-‘'' {Jf*„ («0 - (a,)\, 

*“+00 *“+00 

and by lemma (9.5) the latter Ihn is positive. 

The case of X = 1 requires a slightly more delicate argument. 'I'lic sutu of 
a sample of n values of hU is asymptotically normally distributed, and hence 
it is less than Af/n\ for a suitable Ap , with probability /9. Therefore 

Kni«P) < Kliaff) < K„{a) + 4„n‘ 

and the same process yields the desired conclusion. 


10. A distribution with very long tails. A somewhat pathological e.xample 
is provided by the symmetric cumulative 

^ ln(e^+ |a;|)> 


^ ln{e^ + 


which has the density 


f(x) = 


(cH |a:|){Zn(e^+ |«|)}' 


Since 


lim I X = «> for all X > 0, 


it follows from theorem (9.1) that the distribution of the sum of a sample of 
n values of X spreads out faster than any power of n. The same must therefore 
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be true of the mean of a sample of n. There is clearly no use in taking any 
kind of mean of such a sample. 

There will, of course, be somethmg to gain by taking the median of a sample 
of 2n + 1, since the distribution of the median always shrinks together as 
71 —> and whenever, as is true here, the density is finite and continuous 

at the population median, the distributions of the sample medians shrink toward 
the population median 

This does not prevent some pathology, however, since the cumulative for 
the median of 2n + 1 takes the form 

J^(F(x)r«ii + p(W)), 

where P{t) is a polynomial of degree n with no constant term. Thus, for large 
negative values of x, the cumulative for the median is as}anptotically 

(2n + 1)' 1 

(n!)'(n-f D'We'+|a:|)r 
and the corresponding density is asymptotically 

_ (2n + 1)1 n _ 

(7il)V + l)N6” + |a:|)r+V + k|) 

and it follows that the median has no moments of any positive order, integral 
or fractional. This is true no matter how large the sample usedl 
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UNBIASED ESTIMATES FOR CERTAIN BINOMIAL SAMPLING 
PROBLEMS WITH APPLICATIONS' 

By M. a Gikshick, Fbedbrick Mosteller, and L. J. Savage 

U. S. Department of Agriculture; Stahstical Research Group, Princeton Univer¬ 
sity; and Statistical Research Group, Columbia University 

1. Introduction. The purpose of this paper is to present some theorems with 
applications concerning unbiased estimation of the parameter p (fraction de¬ 
fective) for san^iles drawn from a binomial distribution. The estimate con¬ 
structed is applicable to samples whose items are drawn and classified one at a 
time until the number of defectives i, and the number of nondefectives j, simul¬ 
taneously agree with one of a set of preassigned number pairs. Wlien this 
agreement takes place, the sampling operation ceases and an unbiased estimate 
of the proportion p of defectives in the population may be made. Some examples 
of this kind of sampling are ordinary single sampling in which n items are ob¬ 
served and classified as defective or nondefective; curtailed single sampling where 
it is desired to cease sampling as soon as the decision regarding the lot lieing in¬ 
spected can be made, that is as soon as the number of defectives or nondefecttive.s 
attam one of a fixed pair of preassigned values, double, multiple, and sequential 
sampling. In the cases of double and multiple sampling the subsamples may 
be curtailed when a decision is reached, while for sequential sampling the proc¬ 
ess may be truncated, i e an upper bound may be set on the amount of sampling 
to be done. In section 3 expressions are given for the unique unbiascil esti¬ 
mates of p for single, curtailed smgle, curtailed double, and sequential sampling. 

One or two of the illustrative examples of section 3 may be of interest beumufto 
their rather bizarre results suggest that some estimate other than an unbiased 
estimate may be preferable; but the discussion of estimates other than unbiased 
ones IS outside the scope of this paper. 

2 . The estimate For the purposes of the pre.sent paper the woid point will 
refer only to points in the zy-plane with nonnegative integral coordinates. 

We shall need the following nomenclature. A region if is a set of points con¬ 
taining (0, 0). The point (xj, y^) is immediately beyond (xi, yi) if either Xs 
a:i 4- 1) = 2/1 or X 2 = xi, 2/2 = Pi -H 1. A path in R from the point cfo to the 

point is a finite sequence of points ao, wi, • • ■ , «n sucli tiittt (x, {i > 0) is 

immediately beyond a,_i, and a, </i with the possible exception of ««. A 
boundary point, that is, an element of the boundary B of R, is a point not in U 
which is the last point ot„ of a path from the origin. Accessible points arc the 
points in R which can be reached by paths from the origin, wliile inaccessibk 
points are the points which cannot be reached by any path from the origin. 

1 This paper was originally written by Mosteller and Savage. A communication from 
M A Girshick revealed that he had independently discovered lor the sequential probability 
ratio test the estimate pia) given here and demonstrated its uniqueness. For purposes of 
publication it seemed appropriate to present the results m a single paper. 

13 
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All points are thus divided into three mutually exclusive categories: acc^ible, 
inaccessible, and boundary points. The index of a point is the sum of its co¬ 
ordinates, and the index of n region is the least upper bound of the indices of its 
accessible points. A finite region is a region for which the indices of the acces¬ 
sible points are less than some number ti. In particular a region containing 
only a finite number of points is finite. 

Paths may be thought of as arising by a random process such that a path 
reaching a. = {x, y), a, e R, will be extended to a ,41 = (s, 2 / + 1 ) with probability 
p or to = (»: -h 1, v) with probability g = 1 - p. We exclude p = 0, 1 
unless these values are specifically mentioned. When a path is extended to a 
boundary point of R the process ceases. It is clear from the definitions that for 
a finite region R, paths from the origin cannot include more points than n + 2 
where n is the index of the region This means that a path from the origin can¬ 
not escape from a finite region and that the probability that it strikes some 
boundary point is unity. It is clear that each path from the origin to a boundary 
point or an accessible point has probability p'^g*, if the point has coordinates 
(x, y). We will need the following statements which are immediate consequences 
of the discussion above: 

A. The proiability of a boundary point or an accessible point being included in a 
path from the origin is P{a) — k{a)p''(f, where k{a) is the number of paths from the 
origin to the point. W e shall call P{a) the probability of the point, 

B. For a finite region ^ P(«) = 1, i.e. the sum of the probabilities of the 


boundary points is unity 

Any region for which 23 P{°0 = 1 will be called a closed region. 

a tB 

Of course, all finite regions are closed; but it is convenient to have a condition 
such as that supplied by the following theorem guaranteeing the closure of some 
infinite regions as well 

Theorem 1. A sufficient condition^ that a region R be closed is that lira inf 


Mn)/'s/n = 0 , where A{n) is the number of accessible points of index n. 

Proof We consider the ascending sequence of finite regions , each con¬ 
sisting of the points of R whose indices are less than n. The boundary Bn of 
Rn can be written as the set theoretic union ii:„ U An, where is fl B, and 
An are the accessible pomts of R of index n. Hat Bn and Pn{ct) is the prob¬ 
ability of a with respect to R„ , it is easily seen that for a ( /v,,, P„(a) = P(a). 
Since every point of B is ultimately contained in the ascending sequence Kn , 


23 T(a) = lim 23 P(«) 

atKa. 


= lim 23 Pn (a) < 1, 

n-»M 


the inequality being a consequence of statement B. But 23 PM is mono- 
tonically decreasing because 23 Pn(a) is monotonically increasmg with n 
while 23 P.i(q!) = 1, from statement B. 

acBn 


2 If it IS desired to admit p = 0 ,1, the e.xistence of 
speotively must be postulated 


boundary points (a:,0) or {0,y) re- 
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If we can show lim X) -Pn(a) = 0 under the condition of tlie theorem, 

n—♦« ctCiln 

the proof is complete For any point at An, Pnioc) = k„{a)p''q” *' which for 
fixed p is 0(1/V^)- The sum over is 0(^(n)/-\/n) therefore sint’e the 
hypothesis of the theorem implies that A(n)l'\/n attains arbitrarily small value's 
for arbitrarily large values of n, the sum in question decreases rnonotonically 
to zero. 

CoEOLLAEY. If the number of accessible points of R of index n is hounded, the 
region is closed. 

That the condition given in Theorem 1 is not a necessary condition may be 
seen by examining the region R consisting of all points except points of the form 
{2x + 1, 2i/ + 1) and (3, 0) and (0, 3). 

Theoeem 2. If R is closed and R contains S, S is closed. 

Peoof. The proof is essentially similar to that of Theorem 1, 

Any reasonable estimate ofpwillbe a function defined on the boundary points, 
because the boundary points constitute, so to speak, a sufficient statistic for p. 
That is, the probability of any path from (0, 0) given the boundary point a at 
which it terminates is independent of p, and is in fact 1/A:(a). 

We shall construct an unbiased estimate of p for closed regions R, that is a 
function p(a), atB, such that p(a)P(a) = p (absolutely convergent).* 

oc tii 

CoNSTEUCTiON. Let k*(a) be the number of paths in R from the point (()> 1) 
to the boundary point a, and lei p{a) = k*(,a)/k(,a). Wo remark that the dcfmi* 
tions imply A:*((0, 1)) = 1, when (0, 1) is a boundary point. 

Theorem 3. For any closed region R p(a) is an unbiased estimate of p, 

Peoof: 


Z p(«)P(a) 


= z 


A;* (ft) 

A:(a) 


k{a)p''q^ 


= Z &*(a)p‘'5^ 

a eB 


If (0, 1) is a boundary point, then A;*((0, 1)) = 1 and k*{a) - Q, a (0, 1), in 
which case the sum in question consists of the single term p. If (0, 1) is not a 
boundary point, consider the region R' obtained by deleting (0,1) from li, and 
k'{a), the number of paths in R' from the oiigin to the boundary point a of R. 


z 

atB 


k*(a) = k(a) - k'{a) 

k*ia)p''g^ = Y, k(a)p''q^ - E A:'(ft)p%i' 

atJi ntB 


= 1 - ZA:'(ft)p''9*. 

atB 


Now R' is closed (Theorem 2); except for (0, 1) every boundary point of R' is 

“ Even if such a sum were p for a region wliich was not closed, we would not call the 
estimate an unbiased estimate 
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easily seen to be a boundary point of fi; and h'((x) vanishes except for the bound¬ 
ary points of R'. Therefore 

P +12 = 1, 

a tB 

and the proof is complete. 

It is clear from the construction that 0 < ^{a) < 1; this is rather satisfying, 
since an estimate of p outside of these bounds would be received with some mis¬ 
givings. 

Theorem 3 may be generalized to yield unbiased estimates of linear combina¬ 
tions of functions of the form p provided the points (u, t) are not inaceeasible 
points We need only let the point (u, t) play the role of (0, 1). Even though 
the point {u, i) is inaccessible it may be possible to represent p ‘q" as a polynomial, 
none of whose terms correspond to inaccessible points. 

It is clear from Theorem 1 that pia) is an unbiased estimate of p for the usual 
sequential binomial tests, but the computation may be quite heavy. It should 
be noted that the coordinate system used here differs slightly from the coordinate 
system customarily used in sequential analysis. The custom is to let the x 
coordinate represent the number of items inspected, whereas we use it to repre¬ 
sent the number of nondefectives, this is the only difference between the co¬ 
ordinates. We understand that m applications the customary iirocodure seems 
preferable, but we find the present coordinates more convenient for the purposes 
of ths article. 

In general p is not the only unbiased estimate of p. A necessary condition for 
uniqueness is that the region be simple, that is that all the points between any 
two accessible points on the Ime a: -f y == n be accessible points. In other 
words no accessible pomts of index n shall be separated on the line x + y = n 
by inaccessible points or boundary points. 

Theorem 4, A necessary condition that the estimate p he the unique unbiased 
estimate for the closed region R is that R he simple. 

Proof. For a region that is not simple we shall construct a function m(a) 
not identically zero, such that 


(1) 2^ w(or)P(a) = 0. 

o«B 

p(a) -f- jn^a) will be an unbias6d estimate of p differGnt from p. 

Suppose we have a closed region R which is not simple. We consider the 
west mdex n where the accessible pomts are separated. There vdll be at least 

that ^ sequence of pomts between some pair of accessible points 

hat are not accessible points. It is easy to see that all the points of this un- 
mterrupte^d sequence are^boundary points of R. Let this sequence be the points 

of m(a)°let mfl) ^ To begin the construction 
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the number of paths from a' to the boundary point a with the same convention 
if a! IS a boundary pomt. To complete the construction of mia), let m(a) = 
— [Z'(q;) + { — for boundary points not members of the setiuence 

under consideration. Before proceeding to check equation (1), we show that 

(2) Z I'M V" ; Z V‘{a) e-K 

^ ^ otB atB 

Because of symmetry we need only carry out the demonstration for tlie first sum. 
If a' is a boundary point l'(oi') = 1, and for all other points a I'ia) = 0, and the 
sum IS the single term If a' is not a boundary pomt consider the region 

obtained by deleting a' from R and the corresponding A:'(a), the number of paths 
fiom (0, 0) to the boundary pomts of the new closed region R'. Every boundary 
of R' except a' is a boundary point of R Let us extend the definition of k'{a) 
to the whole boundary of R by defining k'(a) = 0 for a not in the boundary B' 
of R'. Then it is easy to sec that 

k{a) = k'{cc')l'{a) + k'M. 

Now 


1 = Z^w/ 

<xtD 

= k'{a') Z V{a)v''q^ + E h'{a)v'‘q^ 

aiD aiB 

= k'M) E Z'Wp"?* + 1 - 


estabhshmg equation (2) 

We now check that m{<x) satisfies equation (1): 

E m{a)k{a)p''q^ = E (- 1 ^ - E - E {-lY l"ic()p''q^ 

l-o aiB ate 


jaaO 

= P''”9^‘’“‘(z (-lyp'g'"' - g'-*-^ - (-DV'"-') 

= 0 . 


Theorem 6 . A necessary condition that p{a) be a unique unbiased cslfmntr of p 
for the closed region R is Dial there be no closed region /£' whose boundary is a proper 
subset of the boundary of R. 

Proof. Again supposing that the condition is not satisfied we shall con.stnirt 
a function w(a) not identically zero such that equation (1) is salisticil lA>t 
k'y) be the number of paths in R' to « in 5 of R, understanding, of course, that 
k (a) = 0 if a IS not in S' of R'. Consider m(«) = 1 - A:'(«)/fc(«). m(«) is not 
mentically zero because k (a) vanishes for at least one a, liut kia) does not. 
From the closure of R and R' it is obvious that ?R(a) satisfies equation (1). 
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Two simple examples wOl suffice to show that neither simplicity nor the 
condition of Theorem 5 is alone sufficient to insure the uniqueness of p. The 
region consisting of the points whose coordinates are given in the following con¬ 
figuration and whose boundary points are 

a: 

(0,3) x 

(0,2) a: 

( 0 , 1 ) ( 1 , 1 ) ^ ^ ^ 

(0,0) (1,0) (2,0) (3,0) a: 

indicated by the s’s satisfies the condition of Theorem 5 but is not simple. On 
the other hand the region consistmg of all points for which < 3, except for the 
two points (1, 0), (1,1) is simple but does not satisfy the conditions of Theorem 5, 
because the region consisting of all points except (1, 0) with j/ < 3 can play the 
role of B'. 

The authors are unable to decide whether the two conditions together guaran¬ 
tee the uniqueness of p as an unbiased estimate of p, and supply the following 
sufficient condition which is adequate for many practical purposes. 

Theorem 6. A sufficient condition that a closed region have p(a) a unique un¬ 
biased estimate of p is that the region be simple and that there exist g,h{0 < (7, ft S 1) 
such that for all boundary points \gx — hy\ < M. 

Proof. If there were an unbiased estimate of p different from p, subtracting 
it from p would yield an equation of the form (sum absolutely convergent): 

(3) Z m(a)p‘'g* = 0, 

tt iB 

where m(a) is not identically zero. But this will be showm to be impossible.* If 
m{a) were not identically zero, there would be an ao such that ?n(ao) 0 and 
1) m{a) — 0 for aU boundary points of index less than that of cco , and 2) one of 
the coordinates of ao is less than the corresponding coordinate of any other 
boundary point for which m(a) 0. This follows easily from the simplicity 
requirement which implies that the boundary points of index n are broken into 
two sets o) those -whose y coordinates are less than the y coordinates of the ac¬ 
cessible points of index n, and b) those whose x coordinates are less than the z 
coordinates of the accessible points of index nf Since the situations a) and b) 
are symmetrical we suppose without loss of generality that ao is a boundary 
point whose y coordinate is less than that of any other boundary point ivith 
m(a) 7 ^ 0. Equation (3) may be written 

(4) m(a„)p''"2*»-I-= 0, 

* It Will be seen as the proof proceeds that if there are no boundary points to which, 
alternative a) applies, the restriction p > 0 may be removed and replaced by (; ^ 0, simi¬ 
larly if there are no boundary points to which b) applies the condition h > 0 ranv be re¬ 
placed by a 0. 
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where the exponents appearing in the sum are nonnegative. But it will be shown 
that for sufficiently small p 

^5^ 3*" I m(ao) I > p 1 S | , 

which contradicts equation (4). Now 

(6) I Sm(a)p''-‘'»-V I < S 1 m(a) | 

< S I m(a) I pV-^><‘-^q^-f'>‘o+h+u+a>^hv)/e 

= I mid) I (pg*/')*'-*'"-' 

< I m(a) I 

where all the summations range over the values indicated in (5). The summa¬ 
tion indicated in (5) is thus seen to be dominated by a convergent power seriee 
in pg*'^". 

Thus Theorem 6 shows that p is a unique estimate for the sequential binomial 
teats. 

Theorem 7. A necessary and sufficient condition that p be the unique unbiased 
estimate of p for a closed finite region R is that R be simple. 

Proof. The proof follows immediately from Theorems 4 and 6. 

3. Applications and illustrative examples. 

A. Single sampling. In single sampling a random sample of n items is drawn 
from a lot containing items each of which is either defective or nondefective. It 
is customary to estimate p, the proportion defective by the unbiased estimate 
i/n, where i is the number of defectives observed. The boundary of the region 
defined by a single sampling plan consists of all points of index n. Now 

k{{n — i, i)) = and fc*((n — i,i — 1)) = {^ _ Consequently the unique 
unbiased estimate of p is 


the result above. 

It may be of interest to note that an unbiased estimate of the variance pq/n 
of the proportion p, is (: :;)/[(:>] - ^ i estimate 

is obtained by the method suggested immediately following Theorem 3. 

B. Curtailed single sampling. In single sampling schemes, there is usually 
given a rejection number c as well as the sample size n. If c or more defcctivM 
are found in the sample the lot is rejected, but if less than c defectives are found 
in the sample the lot is accepted. It is customary to inspect all the items m 
the sample even if the ^al decision to accept or reject the lot is known before 
the completion of the inspection of the sample. One reason sometimes men- 
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tioned for this procedure is that an unbiased estimate for p is not known when 
the inspection is halted as soon as a decision is reached. We provide the un¬ 
biased estimate in the following paragraph 

In curtailed single sampling the boundary points when rejectmg are (x, c), 
c -h X g B, when accepting {n — c ij), y ^ c — 1. The region is a rec¬ 
tangular array and obviously simple. The unique unbiased eslimato along the 
horizontal line corresponding to rejection with c > 1 therefore is 


p{{x, c)) 


C;-r)/Ct:r 




c — 1 

C + X - r 


or in words, one less than the number of defectives observed divided by one less than 
the number of observations. The unique unbiased estimate along the vertical line 
corresponding to acceptance for c > 1 is 


p((n 




c -f- i 


that IS, the number of defectives observed divided by one less than the number of ob¬ 
servations. We reserved the case c - 1 because it is rather illuminating. The 
construction of Theorem 3 works as usual, and we note that p((0, I)) = 1, 
pifn, 0)) = 0 as Ave might expect, but p((x, 1)) = 0, 0 < x < n. 

It IS somewhat startling to find that the only unbiased estimate of p for cur¬ 
tailed single sampling with c = 1 provides zero estimates unless a defective is 
observed on the first item We remark that the variance of this estimate is pq. 
In other words, curtailed single sampling with c — 1 is no better for estimation 
purposes than a sample of size one when the unbiased estimate p is used, 

A limiting case of curtailed sampling Avhen n is unbounded has been con- 
sideied by Haldane as a useful technique in connection with estimates of tlie 
frequency of occurrence of rare events The region would not be closed unless 
p - 0 were excluded In our nomenclature there is a "rejection number” c 
(c > 1), and Ave continue sampling and inspecting until c defectives have been 
observed. The unbiased estimate' is (c - D/y - 1), where f is the total num¬ 
ber oi observations, and of course this is the estimate given by Haldane, 

C A general curtailed double sampling plan The folloAving example Avill 
Illustrate the sort of calculations involved in computing p for multiple and se¬ 
quential plans. A sample of size ni is draAvn and items are inspected until 1) 
fi J fwl defectives are found, or 2) bi - a + 1 (a ^ 0) nondcfectlvos are 
found or 3) the sample is exhausted with neither of these events occurrmg If 
case 3) arises, a second sample of size n, is draAvn and inspection proceeds until 
a grand total of nin ^ r 2 ^ m) defectives is found or m na - r, -f- 1 


' J B, S, Haldane, Nature, Vol. 155 (1945), No. 3924. 
For the uniqueness, see footnote *. 
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nondefectives are found, In this scheme we call n and rejection luimlHirs 
and a an acceptance number. The unique unbiased eslimate p is asJoUows: 


(a) p(0'. n)) = j j - 0, 1, • ■ • , ni ri ; 

(b) P((ni - fl + 1, f)) = -a + i* 1 = 0,1, • • • , c 


(c) piix, ri)) = 


A 


Xq + ya — 
Xa 


,/xo yo\f ■ 

\ ^0 A 


iV X — iCo + rj — J/d — iN 

A ra - j/o — 1 / 

rT' 


X — Xo + J-a J/o — 
ra — yo 


fli — ri < X ^ ni + ; 


Xo + 2/0 — 1 \/ni + Ha — fa + 1/ “ po “ Xo 


(d) p((ni + na-ra + 




y - Vi 


) 


(" I “‘A 


Hi + Ha ~ ra + 2/ ~ 2/0 ~ XoA 
2/ - 2/0 / 
a < 2 / ^ + TiJ > 


a/i/ierfi summations extend from 2/0 = o + 1 2/o — ” 1, and xo + Po «= ni < 

In the above equations (a) and (b) are the estimates corresponding to rejection 
and acceptance on the basis of the first sample, while (c) and (d) correspond to 
rejection and acceptance when a second sample has been drawn. Itnthor than 
use the sums indicated in (c) and (d), some may find it preferable to make the 
estimation entirely on the basis of the first sample. If there is no curtailing, 
the procedure of estimation is equivalent to single sampling, and the estiinate ia 
again i/ni as mentioned in paragraph A above. If the first sample is curtailed 
and the estimate is made on the basis of the results of the first sample only, the unieiue 
unbiased estimate is given by formula (a) when rejecting, by formula (b) when ac¬ 
cepting, and by i/ui when a second sample is to be drawn. It will be noted that 
(a) and (b) are identical with the expressions derived in paragraph B over the 
range of values for which they arc valid. 

D The sequential probability ratio test. Using the nomenclature of sequential 
analysis,’' the criterion for a decision is given by two parallel straight Units in the 
dn-planc 

(7) di = hi + sn (lower line) 

da = hj + sn (upper line), 

where d is the number of defectives and n is the number of observations. The 
acceptance and rejection numbers for any n are given by a„ and r„ , respectively, 


I See, for example, Sequential Analysis of SlalisHcal Data: Applications, Section 2, 
Columbia University Press, 1946 
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where On is the largest positive integer less than or equal to di, and r„ is the 
smallest integer greater than or equal to ds. We let ka{n) be the number of 
paths from the origin which end in a decision to accept on the nth observation; 
kt{n) is similarly defined when reiection occurs on the nth observation. We 
also require an auxiliary sequential test with acceptance and rejection numbers 
a'n-i = o„ - 1, r„ - 1 (which is equivalent to replacing h and in by hi + 
1-5 and /i 4 - 1 + s in the equations (7)), and with Kin) and fcr(n) the number 
of paths from the origin which lead to acceptance or rejection on the nth observa¬ 
tion for the new teat. A graphical comparison of the two plans shows that: 
The unique unbiased estimate oj p is 

pin) = fc'(n - l)/kain) 
when the original test leads to a decision to accept, and 

pin) = k'rin - 1)/Lin) 

when the original test leads to a decision to reject on the nth observation. 

E. Regions with narrow ihrnais. Let us consider the case of a closed region 
which has only one accessible point of mdex n, n > 0 (n being the lowest index 
not zero at which this phenomenon occurs). The number of paths from the 
origin tb this accessible point a' we will denote m, while the number of paths 
from a' to a, boundary points of index greater than n, will be denoted lia). 
Then the total number of paths to a from the origin is ml(a). We use the con¬ 
struction preceding Theorem 3 to get p(a). The number of paths from (0,1) to 
a IS similarly m*lia), so for such points p(a) = m*/m. In other words, if a 
closed region has a narrow throat such as that described, p(a) for a of index 
higher than that of the accessible point are independent of the shape of the 
region beyond the line a: -|- y = n, and in fact they are all identical. The cur¬ 
tailed single sample with c = 1 is a particular case of a region with a narrow 
throat. 


4. Estimation based on data from several experiments. In the previous dis¬ 
cussion we have been concerned with estimation based on the result of a single 
experiment. Vanous kinds of acceptance sampling plans have been suggested 
as examples of the possible experiments. Acceptance sampling is one of many 
activities where data toward the estimation of p are often accumulated in a series 

t ZSiT f '^hen information 

s available from several experiments the estimate p will no longer be the unique 

rveTsrn,! 7 e^^^^ents, but to illustrate the point, we will discuss 
a veiy simple example m terms of acceptance sampling 

the 1“^ 'f" "0 “Wording to 

second obaewefi n ^pling plan; if a defective occurs at the first or 

it.™ 
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The total number of defective and of nondefective items in the two samples 
form a suflB.cient statistic for p. In a single application of the samiiling plan 
the boundary points with their probabilities are (0, 1), p; (1, l)j P9> (2, 0), g®. 
From this information we can generate the possible totals of defectives and of 
nondefectives which may arise when samples are drawn from two lots, with their 
probabilities by expanding 

(8) (p + pg + = p^ + pV + g‘ + 2p*g + 2pg’ + 2p(^, 

where a term on the right of the form mp'^^ is the probability that in two samples 
there will be x nondefectives and y defectives altogether. On the basis of the 
observed number pair (a:, y), which may be regarded as a possible terminal point 
a for the two experiments performed successively, we wish to form an unbiased 
estimate e((x, y)) = e{a). For the estimate e to be unbiased the condition 
S e(a)P{a) = p must be satisfied, where in the present example the P(a) are the 
six terms on the right of equation (8), and the e{a.) are the estimates with which 
the six probabilities are associated. 

In the example under consideration the condition for unbiasedness will be 
satisfied if and only if e((0, 2)) = 1, e((4, 0)) = 0, e((l, 2)) = c((2, 1)) » 

[1 - e((2, 2))]/2, e((3,1)) = e((2,2))/2. Consequently a one parameter family 
of unbiased estimates is available. Unfortunately the popular condition that 
the variance be a minimum depends on the true value of p; in fact the variance 
is minimized just when e((2, 2)) = 1/(2 + p). So an unbiased estimate of unh 
formly minimum variance does not exist. In praetical applications to accept¬ 
ance sampling one might meet this difficulty by choosing a value of p near zero 
for such a minimization scheme. 

However it is clear that the last word has yet to be said about how best to 
estimate p when one is faced with the results of several experiments, 

6. Conclusion. We would like to call attention to a few problems raised by 
but not solved in this paper: 1) find a necessary and sufficient condition that f 
be the unique unbiased estimate for p; 2) suggest criteria for selecting one un- 
biased estimate when more than one is possible; 3) evaluate the variance of fi. 

In this connection, m a forthcoming paper by M. A. Girshick, it will l>e showm 
for certain regions, for example for those of the sequential probability ratio test, 
that the variance of p{a), 

<^l > pq/Eix 4- y), 

where E{x -f y) is the expected number of observations required to reach a 
boundary point. 



DISTRIBUTION OF SAMPLE ARRANGEMENTS FOR RUNS 
UP AND DOWN 

Bv P. S. Olmstead 
Bell Telephone Laboratories, Inc. 

1. Summary. Using the notation of Levene and Wolfowitz [1], a new 
recursion formula is used to give the exact distribution of arrangements of n 
numbers, no two alike, with runs up or down of length p or more. These are 
tabled for n and p through a = 14. An exact solution is given for p > n/2. 
The average and variance deteimined by Levene and Wolfowitz are presented 
in a simplified form. The fraction of arrangements of n numbers with runs 
of length p or more are presented for the exact distributions, for the limiting 
Poisson Exponential, and for an extrapolation from the exact distributions. 
Agreement among the tables is discussed 

2. Introduction. Assume that 


) ^2 f * * * 


represent a series of repetitive measurements. In engineering work, experience 
has shown that, when the values of these measurements exhibit changes in level, 
trends, cycles, etc., it is usually indicative of the presence of findablo CJiiises, 
In general, the engineer becomes more confident that n findable cause oxist.s 
for a change in level, a trend, or a cycle, when the change is largo, the ti-end is 
long, or the cycle is regular. 


On the basis of this experience, the engineer selects particular measures of 
change m level, length of trend, etc., to guide him in deciding when it is profitable 
to look for a cause. Having selected the measure, he is interested in knowing 
how often he may have to look for a cause that does not exist. One such measure 
is the length of the longest run up or down m a sample of n values. The chart 
in Figure 1, based on the analysis given here, applies when no two values are 
alike and indicates the fraction of all nonidentical arrangements that have 
runs up or down of length p or more. 


Attention is directed to the distribution of sample arrangements that have at 
least one run up or down of length p or more. The distribution and the vari- 
^ces and covariances for lengths of runs up and doAvn are given by Uvene and 
Wolfowitz [1] In addition, Wolfowitz [2] has shoivn that the limiting distribu- 
tion for a particular length of run up or down is a Poisson Exponential. 

The notation of Levene and Wolfowitz [1] will be used. Thus, let ai, a, 

• • ■ , On be n numbers, no two alike, and let the sequence S = (h, h h) 
te any permutation of ai, , • ■ ■ , a„, where S is to be considered’a change 

variable, and each of the n! permutations of m , a,, ■ • , a„ is assigned the same 
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probability. Consider the derived sequence R wliose tth element is the sign 
( 4 . or-) of li.+i -/i,, (z = 1, 2. ••,«-!) A sequence of p consecutive+ 
signs immediately preceded by a — sign is called a run up of length p or more; 
a sequence of p consecutive — signs immediately preceded by a + sign is called 
a run down of length p or more. When such a run is both immediately pre(H'dcd 
and immediately followed by an unlike sign, it is a lun of length exactly p. 
The distribution of arrangements with at least one run up or down of length 
p or more is considered under five specific headings: 



Fig. 1 


1. An exact numerical solution for n small, i.e., computations have been 
completed up to and including n = 14. 


2 . An exact solution for p > -. 

3. A limiting solution for ' 


= constant. 


n 

4. An extrapolation from n small. 

5. Constant probability relationships. 


3, Solution for n small. Starting with a single number, aj, a second numb(‘r, 
02 > tti, may be placed before or after it to obtain the tw'o independent arrange-' 
ments of one run of length exactly 1 . A third number, flj fls > aj, may Iks 
placed before, between, or after the preceding pair to obtain two independent 
arrangements of one run of length exactly 2 and four of two luns of length ex¬ 
actly 1 . Continuing this process it is seen that, on the assumption that the 
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distribution of independent arrangements for (n - 1 ) numbers, ai < Ch < a, < 
• • • < a„_i, is known, the distribution of independent arrangements for n 
numbers, ai ’< 02 < o, < • • • < a„, can be found by using the following re¬ 
cursion formula: 

Fn[rn-1 , rn-2 i ' ‘ ' I I ' ' ' l ^« ) ' ‘ ' I I ' ‘ ) ^l] 

= 2 (ri-l + , ^’n-S , ' ' ’ l (»■< “ 1), ('".-I + l)l ' ' ' Tl] 

.-2 

"b 2 Fb— l[rn-J 1 2'n-S ) ■ ■ ■ 1 (^1 1)1 

/1\ „_3 i -1 

+ 2 L E (»■* +1) 

t *«2 j —1 

• Fn-i[i'ri-it ' ■ ‘) (Xk-i+j H" l)j • • •, (r< ~ ]), I {fj 1), • ■ •, (ri 1)] 

+ E (n + l)Fn-i[r«-a, • • •. (r». 2 . + 1), ■ • • , (r. ~ 2), • ■ • {r, - 1)] 

4-1 


where r,, etc., represents the number of runs either up or doum of exactly length 
i in each arrangement of the n numbers designated F„ , 

( 2 ) srJiV, = r, the total number of runs having lengths exactly i (from 

1 to n — 1 ) for each arrangement included in Fn, 

(3) = n — 1, that is, the sum of the lengths of all such runs in any 

arrangement is one less than the total number of 
numbers, 

F nirn—I , Tn —2 j ' * ‘ ***»^y> ***) ^ll> 

the total number of nonidentical sequences of the n 
numbers with exactly r„_i runs'-of length exactly (n — 1 ), 
••• n runs of length exactly h, ■ • • runs of length 
exactly i, • • • ry runs of length exactly j, ■ • ■ ri runs of 
length exactly 1, Some of these r’s are of course zero 
and their sum is that given in ( 2 ) above. Similar 
statements apply to the four Fn-i’s. 

In the last two summations in ( 1 ), when r# = ri, (r, - 1 ) combines with 
(ri — 1 ) to give (n — 2 ), and when r, = n, (ry — 2 ) combines ivith (ry — 1 ) to 
give (fi - 3). 

By using the above recursion formula, the exact number of arrangements with 
at least one run up or down of length p or more has been computed for n ~ 2 
to n = 14, inclusive. This information is given m Table 1. In addition, it 
has been used to determine the probabilities of arrangements with runs up or 
dow of length p or more as shown in Table 2 . These tables provide a useful 
background for the limiting expressions considered in the next three sections. 



TABLE 1 

Exact Numbers of ATrangements of n numhers with Runs of Length p or More 
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n n 

4. Solution for ^ . When p > ^, itis clear that no seeiufiict! cun contain 

more than one run of length p. Thus, the expected numl)er of runs of length 
p or more in an arrangement is also the probability that an arrangement contaiiiH 
runs of length p or more. Writing Levene and Wolfowitz’s [1] exprewion (4.21 
in the simplified form previously published (3], we have 


(4) PiO = E{r',) = 


for 


^ P < n, 


where r'p represents the number of runs of length p or more*. This cxprcKsion 
checks exactly with Table 2 over the range to which it applies, 


6 . Solution for 

n 


= constant. 


As mentioned above, Wolftwit?„ *,2} 


has shown that the limiting distribution for runs up and down iR a Poi.^Ron 
Exponential. His proof applies specifically to the distribution of nin« of length 
exactly p. However, the assumptions made in his derivation I'ouhl liave 1 k‘C‘ii 
applied to the distribution of runs of length p or more and woulfl have led to 
identical conclusions for such. run,s. To see how closely this is approxininfcd, 
It IS possible to throw expression (4.17) for the variance of (r'p) derived by I>t<vcmu 
and Wolfowitz [1] into the followmg simplified form: 


= / gKn - p)(p + 1) + 1] r, 2(p -H 1)' [Op’ + 7(p - 1)1 

I (P + 2)I L (p + 2)!(2p + f)(2p+ 1) 


(5) 


. 4(p + 2) 
(2p + 3)1 


-1 + r (p +i)[(2p+3)p(p - 1) 
u L’pKp + 2)l(2pTtr3)(2^ 1) 


6 ) 


+ 


2 (p+l)Hn l ■ rpw,, r, 
(2p + 3)! J/ 


-1 


+ 


1 


i.. 4- 1 

L(p!)’ (2p)l. 


J. _ P 

Thus, a (r ) is equal to E(rp) within one part in one thousand for p > 7 and it k 
first two moments approximate those of a Poisson Hximtum- 
hal Makmg use of tbs mformation, it is possible to prepare Table 3,\vhit'h 


( 6 ) 


P(r.) 


1 - e 


= 1 — e 


tomparima of Tables 2 aad 3 shows apoemont to olosor than .0001 f.,r p > u, 

closer agreement may be expected as p is increased. ’ ^ ^ rndicatmg that 
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6, ExtfflpolatioD from the exact solution for n sihelU, Since the exponential 
in pcitiafinn {(!) may l*p written in (he form: 

( 7 ) r «• 'p«i' ^ , g“t5fl(p^l))/tp+2)| 

it follows? (hilt; 


1 ~ P.irl,) 


TAIiLK 3 

Fnrtutu «/ Afran^i-mFntg uf n numbcra with Runs nf length p or More Based on Poisson 

Kxpmtnlial 


Manv-mMvnwi <“11 

» 

1 

1 

A 

4 

s 

& 


8 

9 

10 

>10 


7.121 











■A 

Mil 


oirC) 









1 

«XW( 

itn.M 

.(MX) 

(«)2h 
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am 

i.'4'4:i 

.0105 

.(KKH 







ti 

■171J 

nni.’i 

. lino 

(Cgll 

.(KtiS 

0001 






*» 

' 


7:wi 

.31(57 

(H35 

.0062 

.(XXH 

.0000 





h 


7(U7 

SOfvl 

.nsfl? 

.0076 

.0007 

.0001 

.0000 




!t 


IMOI , 

.yi'w 


.0099 

.0011 

.(XX)1 

.0000 

.0000 



IP 

mh-d 

s'i’i 


,flS25 

.0122 

.0014 

.0001 

.0000 

.0000 

.0000 


11 

wrti 

‘XttO 

■13) 

.0933 

.0146 

.0018 

.0002 

.0000 

.0000 

0000 

.0000 

lU 


'.1211 

•iwa 

.1076 

.0169 

.0031 

.0002 

.0000 

.0000 

.0000 

.0000 

13 


'.m2 

4imi 

.130(1 

.0193 

.0025 

.0003 

.0000 

.0000 

.0000 

.0000 

11 


WM2 

52<« 

.1321 

■ffiie 

.0028 

.0003 

.0000 

.0000 

.0000 

0000 

i:» 

,1«W 

W13 

-5SK1 

.Hll 

.0239 

.0032 

.0004 

.0000 

,0000 

,0000 

.0000 

•M 

1 .WXK) 

!«W . 


.201a 

.0365 

.0049 

.0000 

.0001 

.0000 

.0000 

0000 

■10 



.ai(k? 

.3952 

.0803 

.0118 

.0015 

.0002 

.0000 

0000 

.0000 

ai 

U 

1 .(KXKl 

.il7W) 

.5419 

.1231 

.0186 

.0023 

.0003 

.0000 

.0000 

0000 

«) 

u 

t( 

.0012 

.55.30 

.1639 

.0354 

.0032 

.0004 

0000 

.0000 

.0000 

soo 

<1 

tl 

.91tR5 

.7371 

.2030 

.0322 

.0041 

0005 

.0000 

.0000 

0000 


It 

ir 

I.rXXK) 

.9345 

.3717 

.0652 

.0085 

.0010 

.(K)01 

.0000 

.0000 

««! 

" 

" 

it 

.9990 

0924 

.1677 

.0216 

.0024 

.0002 

.0000 

.0000 

um 


II 


1 00(K> 

.9066 

,2919 

.0428 

.OO'IO 

.0005 

.0000 

.0000 

fifxm 

.. 

ti 


ii 

1.0000 

,8234 

1070 

.0246 

.0025 

.0002 

.0000 


showing that couwcutive values of 1 “• P(r'^) arc related by a constant of pro- 
IKirticmality dt*iM?ndent only on p. Since this is true in the limit, Table 2 was 
exaniimitl to determine similar multipliers for extrapolation. The results of 
lliis cxaminatioii arc showm in Table 4 together with the values of (8) This 

1 " P (r^) 

table shows that the agreement between the value of ^ ~ 

e.g., and e^"5‘»**o)Kp-«)i becomes closer the larger the value of p. The con- 
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stancy of tliR ratio for a given value of p is such as to permit calculation of 
probabilities for any value of n to a minimum of three or possibly four decimal 
places. Huch cakulationa liave been made and recorded in Table 6. The fol¬ 
lowing frumuiae^ went used for these calculations: 

jf’nCK) » 1 

Pn(.n) ^ 1 - (.004373G4)^|y-» 

(9) P.irt) « 1 - (.45093729)(.92404)"-“ 

Pnir'i) •= 1 - (.87687019)(.98561)"-“ 

P«(rt) •« 1 - (.98060695) (.99760)"-“ 

P«(r,) » 1 “ (.99752014)(.999652)"-“ 

or in general 

(10) Pn(r;) » 1 _ (1 - P„,(r;)](Con8tantp]"-"". 

(lomparison of Table 3 with Tables 2 and 5 shows that the difference for given 
p and n. has a maximum for each value of p and that this maximum decreases 
with inereaw in p. The maximum values of the difference shown in the tables 
are.'p 1, w ■« 2, .2079; p =« 2, n 6, .1091; p = 3, n = 20, .0572; p = 4, w = 80, 
.OLM; p » 5, n = 500, .0033; and p = 6, n «= 6000, .0007. Thus, it is apparent 
tliat the fign-ement Iwyond p = 0 should be within .0001 and the method of 
Bection 5 uwd for Table 3 is satisfactory for these probabilities. 

7. Constant probability relationships. From Tables 2, 3 and 5, it is pos¬ 
sible to make interpolations for tlie values of n required to have a probability of 
at least P{r'p) that an arrangement will have a run of length p or more. When 
the conditions of Section 5 apply, the value of n is, of course: 

(11) n = p - p! log. [1 - P(r'j]. 


* It will be noted that the constant for p » 2 has been taken tobe-, whereas the last value 

T 

shown in Table 4 is .53601969. However, alternate values in this series are converging. 

2 

Comparing the«o subwriea shows that by n »• 16, the values would agree with - to eight 

V 

2 

decimal places. An analytic proof that - is the limiting value of the constant has recently 

if 

been fonnd by J. W. Tukey. 

While reading the manuscript J. Riordan observed that the number of arrangements 
with longest length 1, say/(n, 1) has the generating funotton, 

I/Cn, 1) ~ 2(860 (-f tan 1) 

nl 

hence is twice the Euler number for n even and twice the tangent number for n odd, a result 

given essentially by Netto (4). These observations lead directly to the limiting value, 

2 , . 

-noted above. 
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TABLE 6 


Fraction of Arrangements of n Numbers mth Runs of Length p or Mnrr Hmnl rm 
Extrapolation mth Extrapolation Constant 



Ml? 

.03.W 
■fwin 
.1241 
. i(ir.2 
.2044 
.3743 
. 11057 
. inm 
l.WMK) 


.m)2H 

.m:v2 

.fKHO 
.illlK 
,(Uh7 
.0255 
.0322 
.005,3 
. 15KO 
,2025 
.K241 


Sample Size for Constant Probability Based on Poisson Kxpimnttml 
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3 

1 
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! 

4 

® 1 

f) ' 7 . S 

i 

<■99 

7 

20 

71 
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1939 

132C>H 

<•95 

5 

13 

47 

219 

1263 1 

8033 " ; 

<.90 

3 

10 

37 

169 

971 

0037 , 

<.10 

0 

2 

4 

11 

49 

309 < 2200 ; 

<•05 

0 

1 

3 

1 7 

26 1 

1.53 j 1170 ' 10350 

<•01 I 

0 

1 

2 

4 

! 

34 235 1 2030 


» 


TABLE 7 



(i 


132.30 

KOH 

0022 

30B 

34 
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Similarly, it may be obtained from the extrapolation formulae of f3ection 6 
in the form: 

(12) n = Rfl + -PnCrp)] log [1 ~ Pnojfp)] 

log [Constantp] 

Results of computations based on (11) and (12), are given in Tables 6 and 7, 
respectively for particular values of P(rp). It will be noted that Table 7 is in 
exact agreement with Table 2 and that it differs but little in a practical sense 
from Table 6, 
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THE THEORY OF UNBIASED ESTIMATION 
By Paul R. Halmos 
Byracuse Vniuemly 


1. Summary. Let F(P) be a real valued function defincHi on a l/' ijf 

the set 3* of all probability distributions on the real line. A funpti<»ii / of ti r*'Eil 
variables is an unbiased estimate of F if for every system, .Yi, - ■ * , Xn , of indo” 
pendent random variables with the common distribution the 
of j{Xi , X„) exists and equals F(P), for all P in A newssary and wiffi- 
cient condition for the existence of an unbiased estimate is given (Theurr*m 1|, 
and the way in which this condition applies to the momenta of a di^tribnliEin is 
described (Theorem 2). Under the assumptions that thi.s condition is «itif«ded 
and that 2) contains all purely discontinuous distributions it i.s showm that 
there is a unique synametric unbiased estimate (Theorem 3); the most goneml 
(non symmetric) unbiased estimates are described (Thwirern -1); and it. In 
proved that among them the symmetric one is best in the sense of having the 
least variance (Theorem 6). Thus the classical estimates of the mean and the 
variance are justified from a new point of view, and also, from the theory, corn- 
putable estimates of all higher moments are easily derived. It Is ird<‘rc*Hting to 
note that for n greater than 3 neither the sample nth moment almut the ftaniiiln 
mean nor any constant multiple thereof is an unbia.sed estimate of the nth mo- 
meat about the mean. Attention is called to a paradoxical situation iirising in 
estimating such non linear functions as the square of the fimt moment. 


2 . hitroduction. Consider the set 3)* of all probability distributions on t!m 
real line. The elements P of ffl* may be regarded as either set funclimw P(E) 
defined for all Borel subsets E of the real line, (probability meosurcH) or mono- 
tone non decreasing functions P(s) of a real variable x, (cumulative diRtriluition 
unctions). Suppose that F = F(P) is a real numerically valiml fumdion of 
distnbutmns. For example F(P) may be the expectation or tho .steridmtl devia¬ 
tion ofthe distribution P, or it may be the amount of probability P nssigna to 

(stat^tic) of a sample of n from a population with distribution P. in such a way 
m P function is equal to the value of F{P} icUmtieally 

estimate ot ordern over a IS a real valued function/ » /(r,-...rj of « m/d 
vanables, which is such that for every system Y. Y^U- , 

dom variables with tha « j* ry system Xi, ■ ■, X, of mdefiendent ran- 

mates of a given function FfPt? t possible unliiauwl eati- 

given lunction F(P)? (HI) Is there a reasonable definition of -best 

34 
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unbiased estimate” which enables one to select from all unbiased estimates of a 
fixed function FiP) a unique best one?* 

I shall present below a complete solution of these problems, under the assump^ 
tion that the domain of estimation, S), is sufficiently large. The results also 
shed light on some classical concepts. It is possible, for instance, to exhibit 
computable unbiased estimates for all moments of a distribution about its ex¬ 
pected value, and to prove that the known estimates of the expectation and the 
variance are essentially unique. 

The vague concept of sufficiently large estimation domain 2) is easily made 
precise. For any Borel set E on the real line let 2)*(JS) be the set of all those 
distributions which assign the probability 1 to some finite subset of E. Thus, 
for example, if E consists of exactly two points then 3)*(F) is the set of all possible 
probability distributions in a dichotomy. A subset 3) of 3>* will be said to be 
finitely closed over E if 3)*(E) 3). Finitely closed domams are “sufficiently 

large.” 

It is clear that some restriction (from below) on the size of 3) is essential for a 
discussion of the characterization problem (II) and the uniqueness problem 
(III). For if, for example, the domain 3) is artificially restricted to contain 
only one distribution, then there will always be a plethora of completely im- 
related and uninteresting solutions of the problem of unbiased estimation, none 
of which can be said to bo preferable to any other one. It is true, however, that 
the assumption of finite closure is too restrictive. The general problems of 
unbiased estimation are still unsolved over such interesting and useful domams 
as the set of all continuous distributions, and the set of all absolutely continuous 
distributions. There are also more special problems connected with special 
classes of distributions (e.g. the normal and the rectangular distributions), as 
well as the general problem of characterizing the domains which are sufficiently 
large to make a uniqueness theorem possible. I hope to return to these problems 
in the near future, 

3. Existence, A function F(P), defined on a domain £D £ 3)*, \vill be called 
homogeneous over 9), of degree it = 1, 2, ■ • > , if there exists a real vjilued func¬ 
tion <p = ^(xi, - X/J) oi k real variables which is such that for every P in 9) 
the Lebesgue-Stieltjes integral’ 

j "■ j v»(a;i, - * • , JCt) dP(xi) < • • dPiXk) 


I My interest in those problems stems from conversations and eorrespondonoo with 
Reinhold Baer, who first called my attention to the problem of finding unbiased estimates 
for the moments about the expected value. The general questions of existence and 
uniqueness of unbiased estimates were raised explicitly by J. F. Sleffensen in a footnote 
on p. 18 of his book, Some Recent Researchet in the Theory of Stathties and Actuarial Science, 
Cambridge Univ, Press, 1980. 

s All integrals in this paper are to be extended over the entire Euclidean space of in¬ 
dicated dimension. 
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exists and is equal to F{P), and if the integer k is minimal with respect to the 
property of the existence of such a representation. 

Theorem 1. A necessary and sufficient condition that F have an unbiased esti¬ 
mate of order n over S) is that it he homogeneous over 3) of degree k < n. 

Proof. To prove sufficiency, suppose that 

F(P) = /••</ ^b(Xx ,••• , X*) dPM • • • dP(Xk) 


for all P in 9), with Jf, < n. Define / by 

/(a*! j ' ’ ‘ ) Xjfi , X^e^.! > * ' ' , Xn) ^ <ip(Xi j * * * , Xk ). 

Then if Xj, • • ■ , Zn are independent random variables with the same distribu¬ 
tion P (belonging to 9)) 

••• ,X„)} = / //(xi, • • • . x„) dP(xi) •. • dP(xJ 

= / " * / vixi, ‘ , Xk) dP(xi) ■ • • dP(x«) 

“/■'■/ > ' ■ ■ < dP{xi) ' • • dP{Xk) «■ P(P). 


The necessity of the condition is even more trivial: the dehnition of an unbia«‘rl 
estimate of order n is such that the existence of one is equivalent to homoifenpsty 
of degree < n. 

As a special case, and an important illustration of how the degree ia evaluatwi 
consider the moments P„ = P„(P) of a distribution P about the origin, ' 

K{P) = /x^dPfx), 

and the moments Fm{P) about the expected value Pi(P), 

^'m(P) = / (x - Fi{P))”' dP{x). 


Theorem 2. ^ 

BMh of the functions Pi ,•••,/>,, ana jimiely closed over { 0 , 1 ! 
denotes the set containing the two numbers 0 and 1 only), and if k 
arbitrary non negative integers, then the function 


If ^ is any subset of 3* contained in the domain of definition of 
• ,Fr, and finitely closed over ( 0 , H {where (0, 1 } 

’ • , Av are 


P(P) = pj'(p) • • • PJ'(P) 

ts hemogeneous over 3) of degree exaedy k = kx A- ‘k. 
Proof. The representation of P by a fc-fold integral, 


P(P) 


/■■■/ 


*1 • • ■ Xl, Xij+i 




dP(xi) ••• dPixk), 
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shows that F is homogeneous of degree < fc. That the degree of F is indeed 
equal to k is proved as follows. Suppose that 

F{P) ~ j j > *" > dP{xi) ’ • ■ dP(xi,) 

for all P in 3). Observe that if P is the singular distribution which assigns prob¬ 
ability 1 to the point 1 on the real line then the identity of the two representa¬ 
tions of F reduces to v>(li • • • , 1) = 1; similarly assigning the total probability 
to 0 implies that ip(0, • • ■ , 0) = 0. More generally, choose P so that it assigns 
the probability p, (0 < p < 1), to the point 1, and the probability g = 1 — 
p to 0. It follows that 

p'‘ = v’' + V' <Pi + • * * + ] 

where ip. is the sum of all ip(a;i, • • • , x^), over those ?ituples (xi, • • , Xh) which 
contain exactly i O’s and (h — i) I’a If g is replaced by 1 — p in the right 
side of the last equation, the resulting equation is supposed to be satisfied by 
all p, 0 < p < 1. If, however, h < k, then the two sides of the equation are 
polynomials of different degrees; hence h > k. 

CoHOLLAHY. If 3) is any subset of 3)* contained in the domain of definition of 
the function Fm and finitely closed over {0, 1} them Fm is homogeneous over 2) of 
degree exactly m and, consequently, it has unbiased estimates over S) of order n if 
and only if m < n. 

Proof. Since 


p„(p) = / (x - F,{p)rdPix) 

= Z7-0 (-i)'(^)pi(p) / x’"-’ dP{x) 

= Er-o 

the conclusions of the corollary are implied by Theorems I and 2. 

4. Symmetry. Theorem 1 may be regarded as a solution of the existence 
problem (I). An examination of its proof shows, however, that the estimates 
there constructed are very unsatisfactory indeed. In the special case F = Fy, 
for instance, the estimate becomes fixy , • • • , w„) = Xy. The first element of a 
sample of n is, to be sure, an unbiased estimate of the expectation of the dis¬ 
tribution, but it is intuitively clear that, since it ignores most of the information 
at hand, it is not a good one, In order to exhibit the best estimates it becomes 
necessary to study the symmetric ones. Recall that a function f = fixy, ■ ■ • , 
x„) IS symmetric if it is invariant under all permutations of its arguments. The 
proof of the main theorem of this section, the theorem of uniqueness for sym- 
metric unbiased estimates, is based on two lemmas. 
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Lemma 1 . If Q = Q{pi, pn) is a homogeneous polynomial of degree > 
Oinn real variables, such that whenever 0 < p, < 1 , t = 1 , • • ■ , n, and pi + • • ■ 
+ Pn = 1 then Q (pi, ■ • ■ , p„) = 0, then Q must be identically zero, 

Phoof. (Induction on n.) For n = 1 the lemma is triviaL Asaainr- then‘« 
fore that n > 1 and that the lemma is true for n — 1 . Observe that the hypoth- 
esis IS equivalent to the vanishing of Q for all systems of non negative arguments 
(without the restriction pi + ••■ + ?»■=!), since any such system {p,} can be 
replaced by {p./(pi +••• + ?«)). If in Q the variables pi, • > ■ , pn-i arc given 
any non negative values, then the hypothesis implies that the resulting poly¬ 
nomial in Pn vanishes for all non negative values of p„, and therefore identically. 
Consequently the coeflBcients of the powers of p„ in Q, which arc thcunwlves 
homogeneous polynomials in pi, • ■ • , p„_i, vanish for non negative arguments; 
and therefore (by the induction hypothesis) identically.* 

Lemma 2. If 3) is a set of dislrihutions finitely closed over a Borel set. E of the 
real line and if the symmetric function /(xi, • • • , x,) is such that for every dii~ 
iribution P in 3) the LebesgueStieltfes integral 

/ • • ■ / f(xi , • • • , Xn) dP(xi) ... dP(x„) 


existsandhaa thevaluezero, ihenf{xi ,•••,*„) = 0 whenfilierx, t £, i » 1 , ■ * • , n. 

Proof. Consider any point (xj, ■ • • , x\) with x*, e JB, -i - 1, ■.n, and any 
distribution P (in S)*(B)) which assigns the probability 1 to the sulmt jx? , ■ • ■ 
x„) of il. If the probability of Xj is p(, t = 1, • • • , n, then the integral 

/ • “ //(*i . • • • , X,) dP(xi) •. • dP{Xn) 


is a homogeneous polynomial (of degree n) in the n variables Pi . The 

hypo^e^s of Lemma 1 are satisfied-it follows that this polynomial vanbhea 
identically. The symmetry of / implies that the coefficient of the term p, ■ ■ - p, 
IS exactly n /(xi, ■ • ■ , x„), thereby establishing the conclusion of the lemma 

(P - (o(xi, - • IS any function of k real variables and if a is a positive 
mteger, n ^ k, it is convenient to write 


= v‘"'(Xi , • ■ • , x„) 

for the average of the values of ,p over all points obtained from (xi, 

extractmg ordered subsets of fc x’s. Thus, for instance, 

(xix,)"" = ^ (xjjjj -p j-ix, + a;ja;,) 
and 


, »n) by 


— S (xi + ■ ■ • -p xf). 


I am ladshted to J B Bobhc]* atiH n t WbH * ai * 

Lemma 1 was more complicated. ' ^ original proof of 
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Theorem 3. Let 3) be a set of distributions fimtely closed over a Borel set E of 
the real line and let F he a homogeneous function of degree k, 

F(P) = I j <pixi, ••• ,Xh) dPixi) ■ • • dP(x,,) 

over If f{xi Xn) is a symmetric unbiased estimate of F over of order 
n > k, then for every point (xi, • ■ • , x„) with e E, i = 1, ■ ■ ■ , n, f(xi , • • • , 
Xn) is equal to the symmetrized function ¥>'"'(«!, • • • , Xn). 

Proof. Observe first that 

f ■■■ f vixi , ■■ ,xii) dPCxi) ■ • ■ dP(xfc) 

remains invariant if (xx, • • • , x*) is replaced by (x,j , ■ • ■ , x,^), where {I'l, • • • , 
ih] is any subset of (1, ■ ■ ■ , n}, since the change is merely a matter of notation. 
It follows that 

F(P) = j j ^(xi , • • • , Xi) dP{xi) • ■ • dPixk) 

= ,■■■ ,X„) dP(xl) • • • dP(Xn), 

so that v’'"’ is indeed an unbiased estimate of F. Since v)'"* is also symmetric, 
/ — satisfies the hypotheses of Lemma 2 , and the desired conclusion follows 
from an application of that lemma. 

6 . Characterization. For any Borel set E on the real line let 3)*(E) be the 
set of all those distributions which assign the probability 0 to the complement 
of E. Thus, clearly, 3lik{E) C 3}*{E); if F? is the entire real line then 2)*(P) = 
3)*; if S consists of a finite number of points then 3)+(P) = 3l*{E). 

Theorem 4. Let 3) be a set of dislnbuiions finitely closed over a Borel set E 
of the real line and contained in 3)*(E), and let F be a homogeneous function of 
degree k, 


F{P) = J ■■■ J v^Cxi, •.. , X*) dP(xi) • • • dP(xk) 

over 3). A necessary and sufficient condition that the function f = f{xi , ■ • ■ , x„) 
be an unbiased estimate of F over 3), of order n > k, is that the Lebe'sgue-Stieltjea 
integral 


f "" f , • • • , x„) dP{Xi) ■ ■ ■ dP{Xn) 

exist for every P in 3) and that for every point (xi, ■ • • , Xn) with Xi e E, i = 1, • ■ ■ , 
n, the symmetrized function /^”'(xi, • • • , x„) he equal to (p‘'^'‘{xi , • • ■ , x„). 
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Peoof. If / is an unbiased estimate then is a hymmetnc unhitL^etl esti¬ 
mate and therefore, by Theorem 3, equal to p'"'; the converM; follows from the 
facts that 

I ■■■ I /(^i 1 ■ • • I ^") ‘ • • dPis„) 

= J .. . J , ■■■ ,X^) dP{Xi) • • • >iP{x„) 

and that (as a consequence of the hypothesis U) £ the etpiality of /' ”* and 

for points whose coordinates are in E implies the ecpiaHty of their int^'grals. 

Theorem 4 exhibits all possibilities for unbiased estimates (over domains satis¬ 
fying the hypotheses). Given a point (aii, • • • , x„), suppose that the miinlH'r of 
different points obtained from it by permutations of tlie coordinates is jV. (If 
the xi are all different then N = n\). An unbiased estimate is obtamwi if / is 
defined arbitrarily over iV — 1 of these points and if its value on the jVth point is 
chosen so that the identity is satisfied. As long jw the arhitrarj' 

choices at the (possibly) uncountably infinite point groups are not too wild tuul 
not too large (i.e, are such that the resulting function/ is measurable and iuf^gm- 
ble), / wiU indeed be an unbiased estimate. Typical nonpnthological examples 
of UEsymmetric unbiased estimates are weighted averages of the permuted valiu’s 
of itf(a;i , • •• , Xh), similar to the unweighted average ¥>'"*( 3:11 ' • * 1 •£■«)■ 

6. Uniqueness. The assumption of symmetry is a rather natunil one to miuiru 
of an estimate; it amounts to requiring that the estimated value should lx* 
independent of the order in which the observations arc made. Theorems; 3 
and 4 establish that the concept of symmetry is inherently assoeiaUHl with tin- 
biased estimation and that, under this assumption, there is a unique unhuwed 
estimate (whenever there is one at all). These theorems, therefore, constitute 
a partial answer to the uniqueness problem (III); symmetry, after all, W a pos^ii- 
ble interpretation of "good” estimate. From another point of view the answer 
to the problem of “best” estimate is contained in the following theorem, 

Theorem 5. Under the hypotheses of Theorem 4, among all unbiased tslimaten 

of 

P{P) - I j <p{^i, ,Xk) dP{xi) .. • dPixi) 

the symmetric one, ie‘"'(a:i, • • ‘, x„) is the one wilh least vaiiancd or, equwalenlly, 
the least second moment 

j ' ' f ((^‘"’(21, • • ■ , x,))" dP(xi) ... dPM. 

Proof. Observe first that if Z:. ■ • ■ , are independent random variables 
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with the same distribution P then, if / is an unbiased estimate of FiP), the 
variance of/(Zi, • ■ ■ , Z„) is given by 

£{/(Za,...,Z.)}*-B*{/(Zi,-..,Z„)}. 

Since the second term is the same for all/, namely F^iP), mmimizing the variance 
is indeed equivalent to minimizing 

E{f{X^ Z„)}' = /■•■/ {/(Xi ,-• ••, X„))^ dP(Xi) ■ ■ dP(Xn). 


This quantity need not be finite even for/"s and P’s for which E {f(Xi, • ■ ■ , XJ ) 
exists. It will be shown, however, to be minimized by rp'"’ in the sense that 

P{v.‘"'(Zi, • • • , ZJp < P{/(Zi, ■ ■ • . Z„))^ 

for all unbiased estimates / and all P, and that the inequality actually holds for 
some P. 

For the proof consider any unbiased estimate / of F. For any given point 
(xi, ■ • • , x„) suppose that N is the number of different points obtained from it 
by permutations of the arguments, and denote by , f = 1, • • ■ , Z, the values 
of / at these points Smce, accordmg to Theorem 4, = ip'"’, it follows that 

»(^Lf-i/.J < ^Er-i/? = (fy”\ 

Hence 

I f ,x„)j‘dP(xt) .••dP(x„) 


- / ■■■dP(x„) 

= / • • • j fi^i , ■ • ■ , n:„) dP{xi) ■ ■ • dP(x„). 

This already establishes the minimal property of (p'"' in the weak sense. 

If the mequality were an equality for all P for which the terms are defined then, 
by Lemma 2 , it would follow that 

{/V, ••• ,5:.)}''’' 

for all (xi, • • • , x„). Hence the Schwarz inequality, as applied above to the 
sum ^ £ 7-1 /i, reduces to an equality; this can happen if and only if (/i, • • ■ Jn) 

is proportional to , • ■ ■ , , i-o. if and only if all /, are equal to each other. 

The validity of this statement for every point is equivalent to the symmetry of / 
and hence, by Theorem 3, to the statement / = This concludes the proof 
of Theorem 5. 
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7. Concluding remarks. (1) The most obvious estimates of the momenta, 
Fm{P), of a distribution about the origin are the sample moments 



Their use is justified by the uniqueness theorems (3, 4, and 5) of this paper. 
Similarly one might think that the natural estimates of the moments, Pi»(P), 
about the expected value Fi(P)-, are best estimated by the sample momenta 

gmixi {x, ~ 

n 


about the sample mean x = - *{. Denote by/„(xi, • • • ,x„) the estimate 

of K(P) obtained by expanding P„(P) in terms of the F,(P), as in the proof of 
the corolJaiy to Theorem 2 , and then estimating each term by the symmetric 
estimate considered in Theorems 3 and 4. Then an easy calculation shovm that 


f2(xi , ■ • • , X,) = —^ . , ar,) 

and 


f>(xr ^ ^ 

(These functions are the classical estimates of Pj and F ,.) For m> 3,/„ can 
still be exprewed in terms of g% but no longer as a constant multiple of It 
ap^ars that in general /„ is a linear combination of p,, ■ •. , with coefficients 
■which are rational numbers -whose denominators are (n - 1 )(« - 2 ) > ■ • (n - 

^ another aspect of the non existence of unbiased estimates 
of order n for P™ when m > n. 


(2) For any Borel set E on the real line denote by the probability, P(P). 

assigned by P to E. If <p,(x) is the characteristic function of the set JS, the 
representation ’ 


Fm{P) = j <pg(,x)dP{x) 

shows that homogeneous of degree 1 , and therefore possesses unbiased 
estates of dl orders The symmetric unbiased estimate of iTin 

perfect accordance with intuitive demands, by the funCtion/.(x.. ” ^x^)2Z 

value IB - times the number of those coordinates which belong to E, 

“ estimating such “non linear” functions as rPCPll* « 
mew a paradoxical. In the first place it appears strange that there ahoidd be 
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essentially different processes for estimating the expected value and the square of 
the expected value. (Recall that since 

iFi(P)f = j j xiXidP(xi)dPiXi), 

the symmetric unbiased estimate of (FiiP))"^, of order ri, is '.) Consider, 

for instance, the distribution P which assigns probability ^ to each of the points 
+ 1 and —1. The symmetric unbiased estimate of order 2 for Fi{P) is + 
Sj), and for (Fi(R))^ it is XiXi. Hence in the four possible cases 

( 1 , 1 ), ( 1 , - 1 ), (- 1 , 1 ), (- 1 , - 1 ) 

the biased, incorrect estimate + 3:0}'* for (Fi(P))’ yields 

1 , 0 , 0 , 1 . 

whereas the unbiased, correct estimate yields 

1 , - 1 , - 1 , 1 . 

The actual value of (Pi(P))’ is, of course, 0. Hence it is true in this case that 
whenever the biased estimate is in error, the unbiased one errs by the same 
amount. To add insult to injury, the unbiased procedure even yields negative 
estimates for the essentially non negative quantity (^'l(P))^ These considera¬ 
tions seem to indicate the necessity for caution m using unbiased estimates of 
“non linear” quantities, such for instance as Pm{P). 



SOME SIGNIFICANCE TESTS BASED ON OHDER STATISTICS 
Bt John E. Wax-sh 


PriTiceton Vniveraity 

1. Summary, In this paper significance testa are developed whose application 
requires only the determination of one order statistic and the computation of 
sums of sample values. The simplest ewe considered is that of testing a new 
sample value a; on the basis of m previous sample values yi, • • • , , all sample 

values being assumed from normal populations with the same variance. Tw’o 
separate tests of whether the mean of the new population from wliich x waa taken 
exceeds the mean of the population from which > j/« were drawn consiat in 
accepting the alternative that the new population mean exceeds the old popula¬ 
tion mean if 


( 1 ) 


( 2 ) 


£ i'. + Vm+'l , 


where i/(u) is the uth largest of yi, • • •, . It can be shown that both of these 

tests have the same power so that either one might be equally well selected for 
use. In practical application, however, there may exist reasons for pnifcrritie 
one test to the other. Similarly, the alternative that the new population nican 
is less than the old population mean will be accepted if 

/ v^^r+T + 1\ A ,- 

\ m Vm-i-l i/twf 


(3) 


X < 


(4) 


m 

X < 

V m 


1 ^\ " 

~ j S + Vm + 1 t/w V 


' /I 

M four of these significance testa have the same power, also the same significaneo 
level «(«, m). By appropriate choice of n and m th^ signified le^TcT^ 
made to assume values suitable for significance teste. FmTxample, ^ 

«(1> 6) =» .0166, a{2,10 ) = .0107 

«(3,13) ^ .0110, a(4,16) « qiq^^ 

The above tests are still valid if each otx^y,, equals a sum of r sample 

samprvSsVjT!'-, ® T 

sum of relatively weighted past sample vSul^ntfiS h 

statistic. The introduction of ffiio t.i 1 +■ i ^ utilized but not as ati order 

past information to be lumped toeetheVIoff ^liable 

importance. ^ according to its relative 
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In comparing the order statistic tests with the most powerful tests which could 
be used for these alternatives it is found that the size of the samples used must he 
increased in order to bring the eflSciency of the order statistic test up to that of 
the corresponding most powerful test. Thus the advisability of using the order 
statistic test will depend upon whether it is more desirable to take larger samples 
but have less computation, 

2. Introduction. Many statistical problems are concerned with the determi¬ 
nation of whether a new sample can be considered as having been drawn from the 
same population as that from which a previous sample was taken. Frequently 
this reduces to the question of whether the mean of the population from which 
the new sample came is greater than the mean of the past sample population. 
The problem of whether the new population mean is less than that of the old 
population is also occasionally investigated. If both populations can be con¬ 
sidered noimal with the same variance, it is well known that the most powerful 
Studentized test of each of these one-sided alternatives is furnished by use of the 
appropriate Student i-test. When the number of previous sample values from 
which the test is determined is large, however, the computation of the numerical 
value required for the application of the Student t-test becomes lengthy. This 
calculation difiSculty can become very important if the test is to be applied 
repeatedly as, for example, in quality control work. It is desirable, therefore, 
to develop other Studentized tests which are easily calculated and whose efficiency 
with relation to the corresponding Student f-tests is reasonably high. It is the 
purpose of this paper to develop tests of this type by the use of order statistics. 

The class of tests in which a new sample value x is tested on the basis of m 
previous sample values , • • • , 2/m used as order statistics is developed in detail. 
The significance tests arising are the ones given in the summary above. For a 
better intuitive understanding of what takes place rewrite (1) to (4) as 

(10 X — y > Vm + l(y — Vw) 

(20 X y > ■\/m + liV + 2/(m+i-u)) 

(30 X — y < Vm + l(y — 2/cm+i-u)) 

(4:0 a: + g < Vwi + l(y + Vw), 

where y is the average of the j/,-. The relative efficiencies of these tests with 
respect to the corresponding Student t-tests are determined and the simplicity 
of the computation necessary for their application is outlined. The method of 
attack having been sufficiently indicated by the development of this special 
class of tests, more general tests based on order statistics are stated but not proved 
here. 

3. Statement of the significance tests. Let each of x, yi, , j/mbe 
distributed independently of all the others, x according to N (r, o-^) and the yi , 
(i = 1, • • • , m), according to N{ii, /), where the notation ^(J, o-*) signifies the 
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nonaal distribution with mean f and variance ir*. As above let yi^) denote the 
■uth largest of 2 / 1 , ■ ■ ■ ,ym • The one-sided significance teste are then stated as 
follows: 

If 


(5) 


* > ^ 2/- - ^ Vw > f)) 

Aa 1 Aj 

^ Vx — ^ VCm+l-u) (Kl < 0) 

A 2 1 A^ 


accept the alternative y < v, otherwise accept the hypothesis tested, namely 
that y = v. 

If 


( 6 ) 


X < 


X < 


K, 

K, 



Ki 

u) 

Kx 


iKi > 0 ) 
{Kt < 0 ) 


accept V < y, otherwise accept v = y. 

The constants Ki and ifj are given by 

(7) ifi = ffi + 1 ± Vwi -f- 1, ifj— —1=F -^/fn Ij. 

where all upper signs or all lower signs will be chosen so that to a given value of A'l 
there is but one value of K 2 This rule for the choice of signs will hold through¬ 
out the paper. 

It is to be noted that (6) defines two separate significance tests of the hypotheaia 
y = V against the alternative y < v depending upon whether it is decided to use 
the positive or the negative value given for Ki. A similar statement applies 
to the two significance tests defined by (6). 

Each of these four significance tests can be shown to have the same significance 
level, which is determined by the values of u and m. Denote this significance 
level by ai(u, m ). Then it can be demonstrated that 


a(l, m) = (1)" 


a(2, m) = (jn+ 


a{3, m.) - (w* + m + 2)(J)'"+\ a(i, m) = + 6m + 

The general expression for a(u, m) is given by (12). 

It is to be observed that the application of these tests is independent of the 
parameters of the normal populations in question. 


“““ and 
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Now consider this analysis. Let 

, X — V 


y< - 

(T 


(« = 1, • • •, ffi). 


Then x' and the y[ are independently distributed according to N(0, 1). Define 

U = (^Kiyl - £ 2^1 + Kix'^, (m = 1, • • •, m), 


It is easily seen that 


E(tu) = 0, E{rl) =^^{Kl + Kl - 2K, + m), 


E(rur,) = ^2 (Kl - 2Ki + m), 


(m v). 


Thus the condition which must be fulfilled in order that the Tu be independently 
distributed according to N(0, 1) is that 

( 8 ) K\-2Kt+m=^ 0 . 

To insure that the r „ are independent of n when /i = v it is evidently necessaiy 
that 

(9) Kl — TO ri" Kt = 0. 

Solving (8) and (9) for Ki and Jfj one obtains (7). 

Restrict the by conditions (8) and (9) and let rju) be the wth largest of 
ri, ••• , r„ . From (8) Ri > 0; therefore 

n-) = ^- L + KiX^, 

where y\u) is the wth largest oiy'i, • • •, y™ . Then using (9), 


r(u) 


1 r 

= Kl 2 /(u) — ^ y, + 1^2 K -|- Kz{n — v) , 

AicL 1 


From the definition of the power function and (5) for Ki > 0, it follows that 
the power function for this test is given by 


" 1 if ~ 

'Power Function - Pr x > - —yiui 

As I As . 


(10) 


= Pr 0 < Kiyw ~ X) + Kix < «= 
- 1 


Pr 


r 


{n — v)< — '^yx-\-KiX-{- Kiijn--v)\< » 


= in - v) < r(u) < 
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The distribution function of the order statistic may i>e found in [1], from 
which it follows that 
_ _ 

Power Function = , _ 

( 11 ) 


poi / pt \u““l / Vm-u 

/ ( f(y)ciy} ( f(y)dy) fiz) (h. 


where 




Consider the value of the power function under the assumption that the hypothe¬ 
sis IS true. Then fi = v and from (11) the significance level of the test is given by 

?Tl ^ 

(„z I 

(12) ^ ' 

1 *^ / A* \.U*-1 / /»eo VfO b 

• I dyj f(z) dz. 

The method used to elimmate a from the quantities required for the applieatiuri 
of the significance test, therefore, is to have the limits 0 and « in the probability 
expression (10) for the power function when the hypothesis is tme. Suitable 
significance levels are obtained by varying the statistical function rtw by means 
of the selection of the values of u and m. 


6. Comparison with Student t-test. The te.st consideicd is that of u single 
sample value on the basis of m other sample values. Iloncc, the corresponding 
Student 1-test has m - 1 degrees of freedom. The probabilities of Type 11 
errors for the Student 1-tests are calculated for values of 


5 = 


1 + ^ 
m 


by use of the normal approximation given in [2]. 
Using this notation 






1 + i 

m 


and from (11) the power function for the significance test for which the altematiiN 
IS M < V and K, > 0 is found to be 


w! 


(m 1)! (m u) 1 is(Xj/Xi)vT+(I715) 
The probability of a Type II error for 


{Lj(y)dyy \[mdyy'' 


M tiz. 


01 a rype ii error for a given value of 5 is equal to one minus 
the value of the power function for this value of S. 
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It can be proved that the other three significance tests have the same proba¬ 
bilities of Type II errors as the one considered above. 

The numerical comparison of the two types of tests is contained in Table I. 
In each instance the significance level was chosen to be approximately 01. 

The process of increasing the size of each sample by a given percentage has 
practical meaning if feach of x, i/i, • • • , i/^ equals the sum of r sample values 
For example, if a:, yi, ■ • • , j/m each consist of the sum of ten sample values, 
increasing the sample size by 30% would amount to letting x, yi, • • • , each 
equal the sum of thirteen sample values. The case where each ofx,yi, • ,ym 


TABLE I 


Test 

Degrees of 
Freedom 

m 

% Increase 
in Sample 
Size 

Signifi¬ 

cance 

Level 

Probability of Type II Error 


B 

B 

B 

t 

5 


0 


.919 

.750 

.477 

.215 

o.s. 


6 

0 


.919 

.752 

.506 

.276 

o.s. 


6 

5 

.0156 

.916 

.742 

.486 

.256 

o.s. 


6 

10 

.0156 

914 

.732 

.469 

.239 

t 

9 


0 

,0107 

.930 

.735 

.413 

.142 

o.s. 


10 

0 

.0107 

.936 

.782 

.527 

.270 

o.s. 


10 

20 


.927 

.738 

.448 

.191 

o.s. 


10 

30 

.0107 

.921 

.715 

.411 

.161 

t 

12 


0 

.0110 

.920 

.699 

.358 

.106 

o.s. 


13 

0 

.0110 

.933 

.771 

.492 

.245 

o.s. 


13 

30 

.0110 

.919 

717 

.378 

.139 

o.s. 


13 

40 

.0110 

.913 

.679 

.353 

.119 

t 

15 


0 

.0107 

.919 

.688 

337 

.092 

OS. 


16 

0 

.0107 

.938 

.765 

.488 

.234 

OS. 


16 

40 

.0107 

.917 

.687 

.351 

.111 

O.s. 


16 

50 

.0107 

.912 

.664 

.310 

.090 


equals the sum of r sample values will be treated later and will be shown to be 
a particular case of the one analyzed above. 

In Table I the order statistic tests (O.S.) are calculated for cases where the 
size of each sample is increased by the same percentage. This amounts to saying 
that the amount of information used for the test has been increased by this 
percentage. This method furnishes a quantitative estimate of the relative 
efficiency of the order statistic te.st as compared with the corresponding Student 
i-test. For example, if 30% more information is required for the order statistic 
test to have the same probabilities of Type II errors as the corresponding Student 
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t-test, then the order statistio test will be said to have a relative efficiency of 



Examination of Table I shows that the order statistic tests have the approxi¬ 
mate relative efficiencies listed in Table II. These relative efficiencies can 
shown to be approximately the same as those for other significanctf levels. 


6. Computation required. Since application of the order statistic test requires 
only the determination of one order statistic, the calculation of one sum, the 
multiplication of each of these quantities by given constants and the subtraction 
of the resulting values, the amount of computation required for application of the 
order statistic test is obviously much less than is necessary for the application of 
the corresponding Student t-test. 

If the test is applied contmuously from one sample to the next, as in quality 
control work, the value of ^ yt can be calculated by a continuous process. Por 
let the sample values be taken in the order yi, " ■ , Vm, x, where x is the new 


TABLE II 


m 

Sigtdficaace Level 

% Increase in Sam¬ 
ple Sise 

Relative Efficiency 

6 

.0166 

5 

96% 

10 

.0107 

25 

80% 

13 

.0110 

36 

74% 

16 

.0107 

43 

70% 


sample value which is to be tested on the basis of the previous m sample values 
2 / 1 , ■ • •, . Then x for the present test becomes y„ for the next test; be¬ 
comes • ; 2 /, becomes yi, and yi for the present test is no longer u»d. 

The value of i will be furnished by the next sample value drawn, Thus, ^ yi 


for the next test is calculated by adding x - y, for the present test to T y, for 

V,-?® determined from a pl^of tlm 

sample values which is also applied continuously from one sample to the next. 


7. Generalization of results. The derivations given above are immediat^lx, 
applicable to the case where x renresentH the rvL = i 
lation with distribution NW. 7) aTeai vTlz L t 

of r sampe values from a poililon wi^dittron ^ ’A Tt ^ 
b6 distributed aopoTflin+n ^ « / 2 v ^ x would 

toNiru' ra'^) ThpspHit ’K ^ be distributed according 

M = v/;: O and iV(M, whl® 
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If X equals the sum of r sample values from a population with distribution 
N(y, cr*) and each yi, (i = 1, ■ ■ ■ , m), equals the sum of s sample values from a 
population with distribution <t*), the significance tests are derived in a 
similar manner and can still be stated in the forms (5) and (6), but the values of 
Ki and Ki become 


Ki 




The power function for the test in which the alternative is < v and K 2 > 0 


(fi — v) in (11). The significance 


is found by replacing (m — v) by 

ivio' Kiir 

level of each of the four tests is again furnished by (12) and it can be shown that 
each test has the same power. 

To this point aU significance teats considered have consisted of testing a new 
sample on the basis of m previous samples used as order statistics. In some 
cases, however, it may be desirable to utilize additional samples in the test but 
not as order statistics. These sample values can be gathered together in a 
summation term in which values from different samples are given relative 
numerical weighting. This procedure can be used to emphasize those sample 
values which appear to be more important from practical consideration with 
relation to those which seem to have less importance. The determination of 
what relative weighting scheme to use is to be decided by the person applsdng 
the test and is not considered as a problem of this paper. The significance 
tests with this property can be stated as follows: 

Let each of Xa, yib, Zjc, (a = 1, • • • , r; 6 = 1, • • • , a; c = 1, f = 

1, • • • , m-,j = 1, • • • , n), be distributed independently of all the others, the Xa 
according to N(y, <r’) and the yn, and 2y« according to N(y, a). Define — 

S 

2 j ■ ■ ■ I ^)> J/(“) wth largest of yi, 


1-1 

one-sided significance tests are then given by 
If 

'txa> 

I A2 

1 Its 

accept the alternative < v, otherwise accept n = > 
If 

1 n-2 


Vm • The 


2 


^ - y m+l-u 

A* 


(Ks > 0) 


{K 2 < 0) 


(X» > 0) 


{K 2 < 0), 


accept IX > V, otherwise accept n = v. 
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The quantity Vu is given by 



where the constants G,, O' = 1. ’' ’. defined by Cj = Wjit, the Wf being 
given positive weights. The values of ij, Ki and are 


- iA 

_ B y s 


K, 




m + A^jB 


rr ( \ AA . ^ /t r^\ 

^‘“^T3v£r + i + y;^V’ 


B(m + iVS)’ + ^’(5) 






where 


I ^ 




The quantity ri in the expressions for the Cj is not considered given but is clo- 
termined in the derivation of the tests. The two equations corresponding to 
(8) and (9) then contain three undetermined quantities i}, Ki and Kt. Thus 
there are infinitely many possible selections of these quantities, each selection 
resulting in a valid significance test. The values of n, Ki and K% given above, 
however, are the ones which result in the maximum power fimction ami conse¬ 
quently the smallest probabilities of Type II errors. The power function for the 

K 

test in which the alternative is g < p and IT* > 0 is that given in (11) with 

K\ cr 


• (/I - p) replaced by 


KiVr 

Xi<r 


(^i - p). 


The significance level of each of the four 


tests given above is still that of (12). It can also be shown that each of the tests 
has the same probabilities of Type II errors. 
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CHAINS OF RARE EVENTS 

By Feldc Cernuschi^ and Louis Castaqnetto 
Harvard University 

1. Summary. The negative binomial distribution of Greenwood and Yule is 

generalized and modified m order to obtain distribution curves which could be 
used in many concrete cases of chains of rare events. Assuming that the num¬ 
bers of single, double, triple, and so on, events are distributed according to Pois¬ 
son’s law with parameters Xi, , Xs ■ - • respectively, and that X, is given by 

X, = Xi -jp, the probability of obtaining M successful events is studied. In the 

considered relation X,, for convenient values of a, first increases with s and after 
a certain saturation value of & starts to decrease. A relation of this type is very 
suitable for studying the distribution of score in a match between two first class 
billiard players, the probability of accidents on a highway of dense traffic, etc. 
The general methods of findmg the distribution curves for arbitrary relations 
between the X’s are indicated. The method of steepest descent is applied to find 
an acceptable approximation of the distribution function; and the advantage of 
this method'is pointed out for other similar cases, in addition to the concrete one 
which was developed, in which the method of direct expansion into power series 
becomes inapplicable. 

2, Introduction. M. Greenwood and G. U. Yule [1] have deduced the nega¬ 
tive binomial distribution from a compound Poisson law: 

where X itself is a random variable distributed according to Pearson’s law of 
type III: 

P(X)dX = 6“'’"dX. 

a! 

They obtained the distribution 
P{m) = 

* a! ml 

S 

where 1 — a = - easily seen, F(m) is given by the coefficient 

p + 1 

of x” in the expansion of; 
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R. Lttders [2] has arrived at a negative binomial law by the following oonaidera- 
tions. Certain events, like automobile accidents, can be classified as simple or 
multiple according to the number of units involved. Assume that the numl)ers of 
single, double, triple, and so on, events are distributed according to Poisson’s law 
with the parametera , respectively. The probability of obtaining 

ni single, 7h double, nj triple, ■ • • successful events is (assuming mutual inde¬ 
pendence) 


( 1 ) 


P{ni,ni,n3, ••• ;Xi, Xs,X», ••■) = 


nri njl 




The total number of successful events is 


(2) a = -f 2712 d- 3ni + “ • • + fwi + “ • - 

The probability of obtaining n successful events is given by the sum of all expres¬ 
sions (1) subject to the condition (2). This sum is given by the coefficient of 
s" in the expansion 

(3) fix) ~ 

Now if the parameters X, satisfy 


(4) 

one finds 
( 6 ) 
and 
( 6 ) 




Xj ^ Xi 


/(a;) 


\1 - ax/ 


7(7+ 


Taking -d equal to « d- 1 one gets Greenwood and Yule’s distribution in the 

fonn given above [3]. The negative binomial law has useful applications, for 
inst^ce m some cases of accidents of workers in factories. It is proved that 
with values of o near 1, the most probable value for n is n » 0 and the average 
value IS a finite number different from zero. Therefore the distribution will be 
in some way simUar to the distribution of the scores in a match between two first 
class brilmrd players whose most frequent scores are zero and their average may 
be, say, 50. In the case of the Poisson distribution the most frequent score and 
the average score should be nearly the same. The relation (4) does not provide 
description of many practical distributions. For instance m a 
match between two first class billiard players, the probability of making a second, 
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third, • • ■ , point will be considerably greater than the probability of making 
the first. With the relation (4) X. is a decreasing function of s, while we shall 
investigate cases in which X, first increases with s and after a certain value of s 
starts to decrease. As other examples of distributions of similar types we shall 
mention the following: On a highway with dense traffic at high speeds the prob¬ 
ability of only one car being involved in an accident may be smaller than the 
probability of having several cars involved. Something similar may be said for 
the cases of work accidents in factories where the work of one is interconnected 
with the work of others. In many cases of telephone calls (business transactions, 
organization of meetings, etc.) the sample Poisson law is not suitable to interpret 
the distribution of calls, since one call may increase the probability that the 
called person makes one or more calls. 

The purpose of this paper is to treat the problem when, instead of (4), we take 
other expressions which may in a better way describe some processes such as the 
ones which we have referred to. 


3. Modification and generalization of the scheme of Greenwood-Yule and 
Liiders. According to the relation (4) Xt is a decreasing function of s and the 
parameter a must be in the interval 0 < a < 1. Instead of (4) we shall use 

(7) X.-Xi^\ 

where a may have any positive value. In particular for o = 0 our case reduces 
to the Poisson case. 

From (7) it follows that 

X “ a+l 

and we see that X, increases with s for 1 < s < o and decreases for 8 -b 1 > o. 
Substituting from (7) in (3) we get 

(9) /(x) = 

As the probability of obtaining n successful events is given by the coefficient 
of x" in (9), we shall expand e°"^’ in power series (a, /3 being two arbitrary 
constants). We have 

(10) = i + f + 

nM n-1 fll m-1 fi%\ 


2w ”77 a = ynCaJe 

m«I Vfh\ 
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where [4] 

yi(«) = a 

Vtia) = ot + a 

(12) 2/a (a) = ct + Za + a 

/V A*0" { 

2/«(a) = — a . 

.-1 

Here we use the notation of differences of zero: A '0We have 


Now in our case 


whence 


L »=i ra! J 

e = e \ 1+ 2^ —x Zj « . 

L n»l I***! 21 J 

“ , /3 = a, 

a 

^ ^ A‘0" /XiV 

n\ il \ 0 / ' 

P(0) = 


(17) P(0) = for 

We have in particular 

P(l) =XjP(0) 

P(2) ~ ^ (^1 "h o^i)P(O) 

I’(3) ~ ^ (^1 + 3Xia + Xia^)P(0) 

P(4) == |j (^} + 6Xia + 7XiV + Xia')P(O) 

P(5) = - (Xj + lOxlfl + 25 x!o® + 15X?o’ + Xia‘)P(0) 

P(6) = (Xj + 15Xia + 65Xlo' + 90X?o’ + 31XiO* + Xia')PC0) 

^ == ^ + 2lX?a + I40x5a' + 350X^a’ + 30ix!a^ + 63X V 


n > 0. 
for n « 0, 


+ Xia')P(O) 


^(8) - g, (Xf + 28xla + 266XiV + 1050X?o’ + 1701Xia^ + 9CCX?o‘ 


+ 127Xfa‘ + Xja’)P(O) 
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P(9) = ^ + 36Xfa + 462 xia" + 2646XiV + 6951X^0 + 7770x!a^ 

+ 3025Xja‘ + 255X?o’ + Xia'')P(0) 
P(10) = (Xi“ + 45X;a + 750Xfa* + 5880Xla' + 28827X?a* + 42525xJa 

+ 34105XV + 933 OX 10 ’ + 511X?a* + Xia°)P(0). 

For Xi = a it follows that 

(19) P(0) = 6"“+* 

(20) Pin) = e-*“« ~ y„(l) 
tpin)=ee- [l + E 1>„(1)1. 

n-O L J 

Particular values of (20) are 

P(l) = aP(0) 

P(2) =« a*P(0) 

P(3) = ^P(O) 

P(4) = P(0) 

P(5) = ~ P(0) 


( 21 ) 


P(6) = 


203o' 

61 


P(0) 


F(7) = ^ P(0) 
P(8) = P(0) 


P(9) 


81 

21147a’’ 

91 


P(0) 


P(10) == — P(0). 


In Figure 1 we have graphed the curves P(n) for the values — = 1; Xi =? 0.1, 

d 

Xi = 1, Xi = 2. We see in particular, that for Xi = 1 we have P(0) = P(l) 
and for Xi = o = 1 we have P(0) = P(l) = P(2). 
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4. Application of the method of steepest descent. If X. is not given by (7) 
the above method of direct expansion of/(a:) into a power series, usually becomes 



inapplicable. In many cases it is possible to use instead the method of steepest 
descent [6] in order to obtain approximate values for the coefficients of x" in the 
relation (3). 
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As is well known, if fiz) is axi analytical function we have 


(22) coeff. of A - J; / d. 

f{z) 

where X -{-iY = log - 7 +i and the integral is taken along any closed path around 

jS* 

the origin. 

To evaluate the integral ( 22 ) we shall follow a method similar to the one used 
by R. H. Fowler [ 6 ]. Putting 2 = pe‘“ the relation (22) may be written: 

(23) Cde«.olz-^^[y^,da 

where the value of p is arbitrary. We shall put in particular p = Xo where Xo is 
the root of 


(24) 


For most functions which interest us 


XofiXt) 

/(Xo) 

/(x) 


n. 


» as X 0 and asx—yK (a. positive 


number which in some cases may be infinite) and the second derivative is always 
positive. Consequently/(x)/®" has only one minimum between 0 and K, and (24) 

/(xoe’“) 

has therefore only one root xo. Developing log -—f— into powers of a, (24) 


becomes 

(25) coeff. of x" 
where 


fjxo) 

2v Xo" 


c 


Xo e 




da, 


= log . 


2/0 

In the case where ip'''(xo) » 1 the first term in the exponent in (26) increases 

in absolute value very rapidly in the neighborhood of xo. For small values of a 
we may therefore in a first approximation drop all other terms. Also, as this 
first term tends rapidly towards zero one does not appreciably increase the error 
by replacing the integral from ~ ir to 4 - *• by the integral from — to + «. 
In such cases we have, therefore, the approximate formula 


(26) 


coeff. of z" 


/(xo) 
2 t 7 r x" 





/(xo) 

Xo'^^\/2t;p'*(xq) 


We are now in a position to deduce asymptotic values for the probabilities P(n) 
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which we have previously calculated directly. In fact, for f(z} defined by (9) 
we obtain from (26) for large n 


(27) 


xS-\/n{axa+ 1 ) ’ 


where xo is given by 


e®*" = ■ 

In particular for Xi = a and putting oxo = it follows that 

( X \» . P**'" 

1/0/ Vn(i/o + 1 ) 

Comparing the numerical values given by the relation (28) with the c.xact values 
we find that even forn = 4 and \i = 1 (28) gives an approximation with an error 
of about 5%. 

Formula^ (26) can also be used to evaluate the numbers y„(l) defined by (12) 
for a = 1. Relation (13) gives for ur = == 1 


izi n\ 


and therefore 


Coeff. of x" in expansion of e** = 


ey^H) 

n! 


Putting/( 2 ) = e' and using Stirling’s formula for n 1 we have from (26) 


2/>.(l) 


Vzo + i 


“ Applying this relation to/(a) = e‘ one obtains immediately Stirling's Formula- 

/ , , /(d , 

= log — = g ~ 71 log g 


v'{z) = 1 ~ 


*0 "=■ n, 


2_ 

n\ 


iff \ ^ 

e I" 1 

«/ V^’ 


AJao relation (26) is useful to hnd other symptotic expressions; c g. for f(z) » (pj + 9)" one 
obtains for n —* « the Ijaplaoe-Gauss formula. 
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fil 


where aco is given by 


e 


xe 

« 233 


n 

* 


For n = 4, xo ~ 1.202 and yi(l) 16.66. As the exact value of 2 / 4 ( 1 ) w 16 we 
obtain in this case an error of leas than 4%. 

Repeating the calculations for u = 6, a;* = 1,432, we find that 2 / 4 ( 1 ) is given 
with an error of less than 3%. 
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NOTES 


This section is devoted to brief research and expository articles, notes on 
methodology and other short items. 


A NOTE ON SOME SINGLE SAMPLING PLANS REQUIRING THE 
INSPECTION OF A SMALL NUMBER OF ITEMS 


By J. H. Curtibb 
Cornell Universiiy'^ 


In the practical application of sampling inspection plans it ia often necessary 
to restrict the number of items (pieces, samples) inspected from each inspection 
lot to a relatively small number. For example, if many vendors are supplying 
a manufacturer with small lots of various kinds of material, the manufacfcunm 
will usually wish to have some check on his suppliers; however, he cannot afford 
to inspect large numbers of items from each lot. If sampling plana requiring 
the inspection of a small number of items are used, it is advantageous Ur know 
the characteristics of such plans. The present note offers several single sampling 
plans with sample size n < 25, together with their operating characteristic 
curves (00 curves) and average outgoing quality curves (AOQ curves).’ 

Single sampling plans for large lots may be described by the numlwr n of items 
to be inspected, and the rejection number r. If r or more of the items inairccted 
fail to meet some predetermined standard the lot ia rejected; if less than r items 
fail to meet the standard the lot is accepted. 

The 00 curve (see Figures 1, lA, 3 and 6) shows the relationship between the 
probability of rejecting a lot and the true quality of the lot. TTie quality of the 
lot is often measured by the “percent defective" in the lot; i.e., the proportion of 
material which does not meet some predetermined standard. It should l>e noted 
that the definition of 00 curve given here is only one of several in common use. 
In particular, the vertical axis often gives the probability of "acceptance"; such 
a treatment would amount to an “invereion" of the curves given here. Another 


'The material m this note was originally prepared as an office memorandum for the use 
of engineenngtechnioal personnel in a Government Bureau. The author wishes to ewrZ 
IS appreciation to Mr. C. F, Mostellerfor extensive editorial work on the original memoran- 

isnot oustomarvt ciTr- "'r '‘"‘‘■y*® oinsl® oampling plans liwauw it 

anoeorrerdZi iespcclion (awnpt- 

ance or rejection.] is determined before all the items are inspected. In other kinds of 

sam^TureerS’r ^here curtailing is often used aftel the fi«t 

inteSjbthe T**'’"* However, if one is 

inspeotinghis own productmiirht Ha 'ucludmg detailing, as a manufaolurer 
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common form ■would have the “percentage of presented lots (of quality indicated 
on the horizontal axis) that ■will be rejected (accepted)” as its vertical scale. 



Fiaimn 1 


It has been assumed that the lots are so large that the samples can be regarded 
as being drawn from an infinite population, or to put it another way, that there 
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is BO error in treating the samples as if they had been randomly drawn "with 
replacement”. 



Fiqubb 1A 

Especial interest is often attached to the points where the curve cro.*iaeH the 
5% and the 90% probability levels. A rejection probability of 5% is frequently 
associated with a quality value that has been called the "acceptable quality level” 
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(abbreviated AQL), and in published sampling tables by Dodge and Romig/ a 
rejection probability of 90% is associated with a quality value which they call 
the “tolerance percent defective.'” 

The average outgoing quality curve (AOQ curve, see Figures 2, 4 and 6) of a 
sampling plan shows the relationship between the long run average quality of 
the outgoing product after sampling inspection and the quality of the product as 
submitted for inspection. The quality of the product in each case is usually 
measured by the “percent defective” m the product. 

SUPPLEMENT TO FIGURES 1 AND lA. 

Quality of Lot {measured in percent defective) corresponding to various probabilities 
of rejection, for sampling plans in which a lot is to he rejected if one or more 
defective items are found in a set of n random sample items 



Probability of Rejection 


.01 

05 

.25 

.60 

75 

.90 


percent 

percent 

percent 

percent 

percent 

percent 

1 

01.00 


25.00 

50.00 

msM 

90.00 

2 

00.60 

02.53 

13.40 

29.29 


68-38 

3 

00.34 


09.14 

20.63 

warn 

53.68 

4 

00.25 

01.28 

06.94 

15.91 

29.29 

43.77 

5 

00.20 

■ZEH 

05.59 

12.95 

24.21 

36.90 

6 

00.17 


04.68 

10 91 


31.87 

7 

00.14 


04.03 

09 43 

17.97 

28.03 

8 

00.12 


03.53 

08.30 

15.91 

25.01 

9 

00.11 


03.14 

07.41 

14.28 

22.67 

10 

00.10 


02.84 

06.70 

12 95 

20.57 

11 

00.09 

00 47 

02 58 

06.11 

11.84 

20.40 

12 

00.08 


02.37 

05.61 


17.46 

14 

00 07 

HEH 

02.03 

04.83 


15.17 

16 

00 06 

00.32 

01.78 

04 24 

08.30 

13,40 

20 

00.05 

00 26 

01.43 

03 41 

06.70 

10.88 


The average outgoing quality is dependent upon the treatment of rejected lots. 
If rejected lots are cast aside once and for all, and are never resubmitted with all 
deficiencies corrected, then the average quality of the outgoing product after 
the sampling inspection tends to be the same as the average quality of the product 
submitted for inspection (provided that the quality of individual lots does not 
fluctuate too wildly). The only direct effect that the sampling inspection has 
in this case is to reduce the amount of the product which is accepted. However, 

’ H. E. Dodge and H. G. Romig, Sampling Inspection Tables, Single and Double Sam¬ 
pling, John Wiley and Sons, Inc , New York, 1944 
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the situation is very different if a rejected lot is always resubmitted with all de¬ 
fective material removed or replaced with non-defective material. In this case, 



Fiquke 2 


tendTbe beTte^L^n iUBpection wil 

+L * T f average quality of the product submitted for inspec 

tm. In fant, .f th. »bnutM ,n.Uty fe very poor, thn avnmge outgoing quS 
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will theoretically tend to be very good, because so many of the lots are rejected 
and then detailed. 



Under the assumption that each rejected lot will be detailed and resubmitted 
with all deficiencies corrected, a typical average outgoing quality curve starts 
at the origin, rises rapidly to a maximum, and falls off more slowly. The maxi- 
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mum average outgoing quality is called the average outgoing quality limit 
(AOQL) of the plan. 



Fiouke 4 


The graphs give the operating characteristic curves and average outgoing 
quality curves of certain single sampling plans. It is assumed the samples are 
taken at random without replacements from a lot which contains at least 10 times 
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the specified number of samples. In the case of the average outgomg quality 
curves, it is further assumed that rejected lots are always detailed and resub¬ 



mitted with all the defective material replaced by non-defective material. An 
approximation has been made in the calculation of the AOQ curves which makes 
them upper bounds. If it is assumed that many lots of size N of exactly the 
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same quality of product p are being produced and that we are taking wimples of 
size 51 from thenii then it follows that -A.OQ ^ p Pa Cl n/iV }j whore Pa is the 
probability of accepting a lot. Theteim n/N has been omitted j therefore these 



AOQ cuiwes are too high, but are a good approximation provided only that the 
ratio of ^ple size to lot size is small. 'The condition mentioned earlier in this 
paragraph requires that n/N < 0.1. 


ON THE USE OF THE SAMPLE RANGE IN AN ANALOGUE 
OF STUDENT’S f-TEST 

By Joseph F. Daly 
Bureau of Shtpe, Navy Department 

Let ail, • • ■ , represent independent observations on a variate x which is 
normally distributed with mean p and variance a. Assuming no prior informa¬ 
tion about the value of either parameter, let J?o be the hypothesis that p. is equal 
to or less than a specified quantity pa. The classical test of this asymmetrical 
form of “Student’s” hypothesis [1] is based upon the statistic 

i - , 

the region of rejection being defined by the relation t > U . 

For certain applications of a routine nature, however, such as production line 
inspection, the usefulness of this test is rather seriously impaired by the arith¬ 
metical work involved in the computation of t. For this reason Dodge [2] and 
Knudsen [3] among others have proposed tests of Ho based on a statistic of the 
form 



w 


where w is the sample range. It is the object of this note to show how the 
probability distribution of G can be obtained with the aid of the distribution law 
of w tabulated by Pearson and Hartley [4], and to present some numerical results 
which mdicate that the power of the resulting test is the same for all practical 
purposes as that of “Student’s” i-test for sample sizes N < 10. 

The calculation of the percent points of the G distribution is greatly facilitated 
by the following result, which does not appear to be generally known: 

Lemma: If x and w represent respectively the average and the range of a sample 
of N independent observations on a normally distributed variate x, then x and it) are 
statistically independent. 

Peoof: No generality is lost by putting p = 0, v* = 1. The joint character¬ 
istic function of x and the W{N — 1) differences x/ — x* , 0 < fc), is then 

do 


where the summation runs from 1 to N on each index with the understanding that 
tik = 0 for j > k. The usual process of completing the square in the exponent 
then yields 


>f>it, ijk) 




(2ir) 


-(W) £ 




dxi ■ • • dxn. 
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r" f” - 

I e dx ~ I e 


this reduces to 


V’(^i 




which readily factors into 

^ -wm -Jifstij*-;*,)]’ 

Piit) ■ = e -e 

Hence the differences Xj — Xh are jointly independent of J; and since the range 
w is a Borel measurable function of these differences (i.e., w — max \x, -- Xk |) 
it follows that X and w are independently distributed. 

The foregoing lemma is in fact capable of further generalization as follows: 
Let g(xi, • • ■ , xy) be a function which, like the range, has the properly that 
g(xi -j- a, ■ • , Xf, + a) = g(,xi, ■ , xk). The characteristic funclimi of Jamlg 

can then be written in the form 

P(t,h) = - Pi(l)-Hl.h). 

J—ao 

Now if the second factor ^ is analytic in I, it must be a constant as far as varia¬ 
tion with t is concerned; for by putting t = iNa (a real) we have 

^(iNa,X) = 

•Loo 

•Loo 

= (2,r)-w*> £ dzi-‘-dz^ ^ p,(X). 

Therefore i//(t, X), being constant in t along the axis of imaginaries, must lx: fiw 
of i throughout the complex plane. The joint characteristic function of £ and 
g 18 thus equal to the product of their respective characteristic functions, so that 
the two variates are independently distributed. In particular this result shows 
that m the normal case each of the moments about the sample mean is distributed 
mdependently of x 

Returning now to the distribution of G, we see that for (it > 0 

n » > «•/ - a—^o7- > “/f) 

/•“ r‘/VNo, 

~ J I f{z)h{v))dw dz 

v'z^O •'ur-»0 

= jf mP{z/VNQt)dz 

where f {z) m the normal probability function for g = 0, cr“ = 1 and Pful is 
the value [4] of the probability that the range of a sample of ob^rTationB 
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will be less than u standard units. For selected values of N Table I gives the 
value Got such that 

Fw{(f — no)/w > 1 = Mfll = .05. 

TABLE I 


Upper 5% points for distribution of G 


N 

Gos 

3 

.88 

5 

.39 

7 

.26 

10 

.19 


These values were calculated by Simpson’s rule and checked by Weddle's rule. 

To evaluate the probability that G will exceed G, when p po we may write, 
following Johnson and Welch [5] 

X — pa _ -s/Njx — p)/<T + \/N(p — po)/<r _ z + a 
w y/Nw/a ‘ 

The required probability is then given by the integral 

Table II is a comparison of the probability that G will exceed Gao with the 
corresponding probability that “Student’s” t will exceed i os foi various values of 
{p — po)/<r, the case N — 3 being chosen because the non-central t distribution 
is formally integrable in this case. 

TABLE II 


Probability of rejection for G and for t, {N = 3) 


(m — Mo)/cr 

P{G > .88} 

P{t > 2.92} 

.00 

.060 


.50 

.151 

.151 

.75 

.229 

.230 

1.00 

.322 

.322 


Similarly for W = 10 it was found that when p — po = ,383o- (i.e., when a = 
1.21) the probability that G will exceed G os is .296; the corresponding probability 
for t is given by Neyman and Tokarska [1] as .30. 

Pending the construction of more adequate tables of the percent points of the 
G distribution, it seems worthy of note that for iV < 10 the values of G,os can 
be estimated quite accurately by multiplying the corresponding upper percent 
point io6 by the factor 
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. /2(a: - iV 

■y/NE{wl 

where EH is obtainable from Tippett’s table of the mean range (fl). Eatimatecl 
values of Gm for sample sizes from 3 to 10 are listed for convenience in Table III. 
The approximate values of (?,os proposed by Knudsen [3] W'ere calculated in 
essentially this fashion, using however the s quare root of the expected value of 
S(a; - a)’ instead of the expected value of \/S(a; ~ *)*, and employing percent 
points of the t distribution determined by the relation P{U1 > ** .05 

instead of P{t > t.osj = .06. Thus thou^ the agreement between the values 
listed in Table III and the corresponding computed values shown in Table I 
is extremely good, the discrepancy between these values and those ©ven by 
Knudsen is rather large. Any error committed by using Knudsen's table will, 

TABLE III 


Estimated upper 6% points for distribution of Q 


N 

G.w 

3 

.882 

4 

.620 

5 

.385 

6 

.309 

7 

.260 

8 

,227 

9 

.202 

10 

.183 


however, be on the conservative side, in the sense that the probability of un¬ 
justly rejectmg ffa will have somewhat less than half the value indicated in that 


(11 J. Neyman "Etrots of the eecoad kind in testing 'Sludenfs' hypoth- 

(1936),pp,318-320. 

' (1932)! iaBompling inspection,-' AmericanMaehiniH, Vol. 76 

[4] E. S Pearson and H D, Hartley. ‘*The probability integral of iU range in ntmolm of 

[6] N, L, Johnson and B, Jj, ^Vblcii ^^ADDlication nf Vol.32(1942),pp.30MlQ^ 
Vol 31 ilMIUMeS 



AN INEQU-ALITY FOR DEVIATIONS FROM MEDIANS 

Bt John W. Tuket 
Princeton University 

In a recent note in these Annals, Bimbaum and Zuckerman [1] proved that if: 

(1) Xi, Xi, • ‘ , X„ are independent random variables with the same 
distribution (ie., form a sample), 

(2) their common distribution is symmetric about zero, 

then 


P{\ Xi + Xi + ■ • • + X„ |) > (p{n) ■ E(1 Zi I), 


where 


<p(^2k 1 ) == (p{Qik -|- 2 ) = 


1.3-5-7 ■ 
I.2.4. 


■■■ (2k+ 1) 
6 . ■ • (2fc) 


It is the purpose of the present note to extend this to the following, more 
general, result; 

Theorem. If 


(i) Xi, Xi, • • ■ , X„ are independent random variables, 

(ii) the median of each Xi is zero, 


then 


E(\X, + Xi + ••• + X„|) + \Xi\ + ... + |Xn|) 

7lf 

It will be convenient to let = X(| X< |) and 

d^^T.d, = - E(\Xi\+ \Xi\+ + \X„\), 

Jh 

so that the desired inequality becomes 

^(1^1 + ^3 + ’••+ X„ ]) > ip{n)-i. 

Define et by 

e, = f xdF,{x) , 

Jo 

where F,(z) is the cumulative distribution function of . Since 
d, = E(\ X< 1 ) = — xdFt(x) + J xdF({x) , 
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it follows that 



Bf d(, 


The basic idea of the proof, which is common to both. noU*s, is to divide the 
n-dimensional apace of xi, Xi, into its 2" “octants,'' break up the 

expectation of | Xi + + • • • + X„ | into the corresponding parts, and apply 

elementary inequalities. Let Os be the octant in which a wt iS of variabloa 
are < 0. From (4), (5) and hypothesis (ii) it follows that 


2 


n-1 


■■■ x.I[dFiixi) = 
Os’’ 


- dt, 


if Xt > ^ 
ifXi < 0 


iit Oa , 
in Os. 


Hence 


2”~^ ['■ • [ 23 H dFfixf) = 23 “■ 23 di «« e — 53 rff, 

O/ ^ 

where c = S e;, and xhe second and third sums are over all dc for which x< 0 
in the chosen octant Os . The contribution of the octant Os to 7!?(| A") 4- AT? 4* 
•• Zn|)ia 

/■ • •/ I 23 X. I n dFjixj) (23 a:<) II dFjixj) 

Os Os 


^ I 

For each value of s, there will be (:) octants with a variables S 0, The eum 
of their contribution to £/(! Zi -|- X 2 + ■ • ■ X„ |) is 


where the ineqadity followB from 11 o. | > 1 2 „, |, and it i. noticed ttot each 
d, occurs in J j diflerent inner sums. Becnlling that Zdi = nS, this may 



1 6 — 1 . 


be written 
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Finally, 

F(1 - Zs + • •. + Z„ I) = i /, > r f'""') \e-sd\ 

«=o 1=0 \ s / 

> (^\ {1 e - sd I + I e - (n - s)d 11 

2 «<n \ S/ 

> 2 -(n-i, 2 (A(n-2s)d, 

where the last inequality follows from | o | i- | 6 | > & — a. To complete the 
proof, it is only necessary to evaluate the last sum One method of evaluation 
may be found in Birnbaum and Zuckerman’s note. 

If each Z, = ±1, each with probability one-half, then all of the inequalities 
of the proof become equalities. So that, m this case, 

.0(1 Xi -j- Xj • • • -|- X„ j) = ipin) • d. 

Since the limiting distribution in this case is a normal distribution with 
standard deviation n* and 0(| Xi -h X 2 • • • + X, |) = (2n/7r)\ it follows that 
this is the asymptotic value of ip{n). 

The inequality of the theorem is only efficient when the 0(1 X, 1) are of nearly 
the same size. In other cases it can often be usefully supplemented by the 
Lemma. If 

(i) Xi, X 2 , • • • , Xn are independent 

(ii) for each i, either X« has median zero, or the sum of the means of the other X, 
is zero (this is implied by either (a) the median of each Xi is zero, or (b) the mean 
of each X, is zero), then 

0(1 Xi Xj -f- • ■ • + Xn 1) ^ Max 0(1 Xi IL 
The lemma follows from the case where ti = 2, by applymg that case to 

F. = x,„, F 2 = E , 

where the maximum of 0(1 X, ]) is attamed for i = io 
The special case follows from the inequality 

I a:i + xj 1 > 1 a:i 1 -j- ij-sgn Xi, 

since this implies 

Ei\ Xi + X 2 1) > 0(1 X, 1) + 0 (X 2 ) •0(sgn XO = 0(Xi) 
using first d) ii'iid then (ii). 

In conclusion, it is interesting to note that the mean cannot replace the 
median in the hypothesis of the theorem. For let Xi, X 2 , X 3 be independent, 
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and take the values 1 (with probability 2/3) and -2 (with probability 1/3). 

^ Xa + Xa takes the values 3 (with probability 8/27), 0 (with probability 
12/27), -3 (with probability 6/27) and -6 (with probability 1/27). Hence 
E{\ Xi 1 ) = 4/5, and Ei\ Xi + Xt + X, |) = 48/27 - 16/9 « 4/3fi(i X. |), 
which 13 not > d/2E(\ X, |). 
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ON THE INDEPENDENCE OF THE EXTREMES IN A SAMPLE' 

By E. J. Gumbei, 

New School for Social Research 

In a previous article [1] the assumption was used that the mth observation in 
ascending order (from the bottom) and the mth observation in descending ortler 
(from the top) are independent variates, provided that the rank m is small com" 
pared to the sample size n. In the following it wll be sbovm that thifl assump¬ 
tion holds for the usual distributions. 

Let a: be a continuous, unlimited variate, let * (i) be the probability of a value 
equal to, or less than, s; let <is (x) be the density of probability, lianceforth called 
the initial distribution, The mth observation from the liottom is written „x 
and the fcth observation from the top is written x*, Thus, the bivariate dis. 
tribution tDn(m^, Xj.) of and Xk , is such that there are 7n — 1 oljserv'ations less 
than „a:, ^ - 1 observations greater than x* and n ~m - k observations between 
„x and Xk . 

For simplicity’s sake write 

4’(mx) = i{Xk) = . 

v(mX) p(x*) “ ifik . 

Then 

(1) Xk) = - „$)’‘-'"-V*(l - $*)*-*, 

where 


(10 


C = 


(m -- 1)1(4: — l)!(n _ jji 

In the expression (1) no assumption about dependence or independence of 
Xk 's/mp led except that these values are taken from the same population. 
The distnbution (1) is now modified by introducing three conditions. First, 

‘ Research done with the support of a grant from the American Phlloaophioal Society. 
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that the two variates are extreme, namely that the ranks m and k are of the same 
order of magnitude and small compared to the sample size n. 

(2) n > > m a>! k = 0(1). 

Furthermore it is assumed that the initial distribution vix) is, for small and for 
large values of the variate, subject to L’Hospital’s rules 


(3) 


u,„ ?:(£) =. ito 


Ilf—ao 


-oo $(*) 


lim 

SwOO 


<p(a:) 


— lim 

*-oo 


1 — 4>(x) ■ 


Finally it is assumed that is so large that the equality of the limits may be re¬ 
placed by the equality of the quotients. Then it is legitimate to write 

/ r 

(3') !251 = = y* 

„<p yt 1 — ' 


Clearly, the three conditions do not imply any assumption about dependence or 
independence of the two extremes. 

From (1) the moat probable mth value from the bottom, mU, and the most 
probable ifcth value from the top, Uk , are the solutions of 


m 


— 1 I my' 

-V— my H- 

7iW «y 


n — m — k 
4’* - m^> 


my. 


n — m — k , wk k — 1 
yt i-- 


y* 1 — 

These two equations may be written by virtue of (3') 


y* 


m n — m — k _ k 

m^ 1 — ^t ‘ 

Consequently the probabilities of the most probable mth and Ath values 
and Uk are 


(4) 4>(mW) = —; ^(ut) = 1 - - . 

n n 

The expansion of the probabilities and around the modes and Uk leads 
[2, 3] by virtue of (2), (3), (4), to 

(5) $* = !-- e-«. 

71 71 

where 

( 6 ) mV = ~y(mW)(mX - mU)‘, Vk = J <p(Uk)(Xk - Uk). 

m K 

Therefore, distributions, subject to L’Hospital’s rules (3), may be said to be of 
the exponential type. Since the derivatives m(p and (fiu are 
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(7) m¥> = Vk = 

where 

(7') mff = ^ ” I 

the product of the first two and the last two functions in formula (1) may be 
written as a product of two functions 

(8) v).(i ~ = (-«^ )(«* 

Clearly, each factor in (8) depends only on one variable. 

In the same way the function of and Xk m the middle of (1) can bo split up 
into a product of two independent functions, each depending only on one vari¬ 
ate. By virtue of (5) 

4k — = 1 — ~ (me”'’ + fcc""*) 

u 

and by virtue of (2) 

(9) (4fc — = eKp(—me’"'0 exp( —fcc"*'*), 

where 


exp(a:) = e*. 

From (2) the constant factor (!') may also bo split into a product 

( 10 ) -_ 

(m - 1)' (fc - 1) !(w — m — fc)! (m — 1)! {k — 1)! ’ 

Introducing (10), (9) and (8) mto (1), the bivariate distribution of the mth ex¬ 
treme value from the bottom and the fcth extreme value from the top is obtained 
as a product of two independent distributions 


2Ii) — ln/(m®) "/kC^lt) 

where 

(^2) mf(mx) == exp(m„!/ - me"*') 

and 

(^2') fkixk) = exp(-fcj/k - fee"*'*) 

are the distributions of the mth extreme values from the bottom, alone, and of 
the fcth extreme values from the top, alone. 
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In the sjjecial case m = k and for a symmetrical initial distribution mth mean 
zero, the following equations hold 

(13) ~ , rail “ " “ Wn ■ 

(130 „■!■ = 1 — = 1 — #m; m<P = <Pk = <Pm- 

and the bivariate distribution of the mth values from the bottom m®, and from 
the top x„ , is 

(14) ^n(m^> ^m) " m/(«^) ■/m(^fn)i 

where 

(140 m/(mX) = /ra(— Xra) 

is the expression used in the beginning of article [1] 

It follows from (11) that the mth observation in ascending order, and the fcth 
observation in descending order, may be dealt with as independent variates 
provided that n is large, the ranks m and k are small, and that the mitial con' 
tinuous unlimited distribution is of the exponential type as defined by equations 
(3). 
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A NOTE ON SAMPLING INSPECTION 

By Paul Peach and S B. Littauer 

North Carolma Slate College and Newark College of Engineering 

In designing an industrial sampling plan conformable to the Pearaon-Neyman 
approach, the operating characteristic is made to pass as nearly as possible 
through two predetermined points Wald [1] has used this method for setting Up 
sequential sampling plans. 

A similar type of single sampling plan can be designed by using tables of the 
incomplete Beta function. Unfortunately, tables of this function are not 
generally available, and the existing tables do not cover the range for large 
sample sizes. 

An approximate solution of the problem for single sampling can be based on the 
widely available tables of percentage pomts of the chi-squaie distribution. This 
is equivalent to assuming a Poisson distribution of defectives in the sample, 
utilizing the well known fact that for even degrees of freedom the chi-square 
distribution gives the summation of a Poisson series. 

We use the following well established notation: 
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n = sample size 

c = acceptance number 

Pi = acceptable fraction defective 

pi = objectionable fraction defective 

a = risk of rejecting a lot if p « Pi. 

/3 = risk of accepting a lot if p « p». 

There seems little to be gained by using a large assortment of ixwsible risk 
values, since the necessary adjustment to secure a desired effect can Im made 
on the p’s. We suggest the adoption of .06 as a standard value for Iwth a and fi. 
This convention conforms to much existing statistical practice, in particular to 
some existing inspection tables. 

We propose also the use of 

Ri - Pt/Pi , 

which we call the “operating ratio,” as a measure of the power of diKcrimination 
of an inspection scheme. Dodge and Eomig [2] used what is efwentialiy the 
reciprocal of Ro as a basis for the construction of sampling plans. Nnw, assume a 
binomial distribution of defectives m samples and a ^ries of single sampling 
plans with the same c but different n. Ae n increases, the effective values of 
Pi and pj clearly decrease. Their ratio Ra is not constant, but it done not change 
very much after n has got beyond the range of very small wvmplcs-'-aay 5(c +1). 
The value obtained from the chi-square table is the upper limit of R# for a fixed c 
and increasing n. Since Ro is to a first approximation a function of c alone, 
provided n is not very small, it is a useful index for the construction of tables, 
and gives great compactness. 

Using the chi-square approach, we note that 

D, F, = 2c -{■ 2 


^Pi ~ Jxso+J.i-o 

R, = . 

Table I gives Ro , c, and npi over a considerable range, with a «i3 * .06. 
Given Pi and pi, we calculate Ro and use it to enter the table; c is read off directly, 
and the sample size is n » wpi/pi. 

Sample sizes obtained in this way will be too large when the true distribution 
of defectives follows the binomial or hypergeometric laws. There is, however, a 
gain in protection due to the extra inspection. For the binomial case the exact 
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TABLE I 


Single sample inspection plans 
a = = .05 


Rq 


npi 

68. 

0 

.051 

13. 

1 

.355 

7.6 

2 

.818 

5.7 

3 

1.366 

4.6 

4 

1.970 

4.0 

5 

2.61 

3.6 

6 

3.29 

3.3 

7 

3.98 

3.1 

8 

4.70 

2.9 

9 

5.43 

2.7 

10 

6.17 

2.63 

11 

6.92 

2.53 

12 

7.69 

2.44 

13 

8.46 

2.37 

14 

9.25 

2.30 

15 

10.04 

2.24 

16 

10.83 

2.19 

17 

11.63 

2.14 

18 

12.44 

2 10 

19 

13.25 

2.07 

20 

14.07 

2.03 

21 

14.89 

2.00 

22 

15.72 

1.'92 

25 

18.22 

1.81 

30 

22.44 

1.71 

37 

28.46 

1.61 

47 

37.20 

1.61 

63 

51.43 

1,335 

129 

111.83 

1.261 

216 

192.41 


In view of the approximate nature of this table due to the Poisson distribution, 
it is suggested that when the calculated value of Ro does not appear, the table be entered 
vdth the next larger value. This rule will result in partial compensation for the 
approximation. 
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values Pi and pa for a given n and c can be calculated, using a table of the 
5 per cent points of the F (variance ratio) distribution. We may take 

m = 2(n — -c) 

= 2(c + 1) 

Fi = F(ni , ni) 

Fi = F{ih . wi) 


Then 

__ n* 

«2 + niFi 

and 

_ n^Fi 

ui d- 712 F 2 


utDizing a property of the F distribution pointed out in [3], page 2. 
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ON AN EQUATION OF WALD 

By David Blackwell 
Howard University 

Let Xi, X 2 , • ■ ■ be a sequence of independent chance variables with a com¬ 
mon expected value a, and let iSi, (Sj, • • • be a sequence of mutually exclusive 

eo 

events, & depending only on Xi, • ■ • , Xi, such that 2 Pi^k) = L Define 

the chance variables n = n{Xi, Xi ,-■•) = k when Sk occurs and W = Xi-i- 
■ • • -h X„ . We shall consider conditions under which the equation 

(1) EOV) = aE{n), 

due to Wald [3, p, 142], holds. 

This equation has various interpretations: 

A. n may be considered as defining a sequential test on the X<. If 0 and 
E{W) are known, (1) may be used to determine E{n), the expected numlxir of 
observations required by the sequential test, [3, p, 142 et seq]. 

B. n may be considered as representing a gambling system, i.e, it represents 
the point at which a player decides to stop. W then represents his winnings, 



ON AN EQUATION OF WALD 


85 


and ( 1 ), in the special case a = 0 , says that, if each play is a fair game, then the 
system leads to a fair game. 

O. n may be considered as the duration of a random walk. The meaning of 
W and ( 1 ) is obvious. 

More exactly, we shall investigate conditions on Xi under which (1) holds 
for every test n of finite expected value. Our results. Theorems 1 and 2, are 
that ( 1 ) holds if the X, have identical distributions, or if they are uniformly 
bounded Theorem 1 is a generalization of a result of Wald [3, p. 142]. 

The test n may be considered as a test on the variables Yt = X, — a. Then 
W' = + ■ ■ • + y„ = W — wffl, so that EiW) = 0 is equivalent to ( 1 ) for 

tests of finite expected value. Thus it js no loss of generality to assume o = 0 
and to seek conditions under which EOV) = 0. We remark that if E(n) does 
not exist, then jB(pr) need not be zero. For example define Xi = ±1 with 
probability and n as the smallest integer k for which Xi -{-■■■ + Xi, = 1. 
Then E(W) = 1. (It follows from Theorem 1 or 2 that E(n) cannot exist, which 
can also be shown directly.) 

Theorem 1. If Xi, X 2 , • ■ have idenitcal disiributtons, E{X,) = 0, E(n) < 
00 , then E(W) = 0. 

Proof: Define chance variables n* inductively as follows: ni = n. Supposing 
Wi, ,nfc to be defined, define n-i+i = .+„j+i, J„,+, ••■) 

i.e. ni, nj, • • • are the successive values of n obtained by iterating the test. 
Then 


( 2 ) 


Pint, ••• , Uhl Uk+i = j) = PiSi). 


For the event {nt = at, , n* = aid = R depends only on Zi, > 

while under the hypothesis R the event [uk+i = 3] coincides with the event S = 
■•■) = j}. Thus P«(S) = P(fif). Finally P{S) = PiSf) 
since S is defined by imposing the same conditions on Zo,+. .+ 044 . 1 , ■ ■ ■ that Si 
imposes on Xi, • ■ • , X/. ( 2 ) shows inductively that ni, na, • • • are defined 
everywhere and are mutually independent with identical distributions. Now 
define Wk = X„j+...+„ 4 _j +1 + ■ • • + X„i+ +„*. A similar argument shows that 
Wi(= W), W 2 , • ■ ■ are also independent variables with identical distributions. 
The strong law of large numbers [2, p. 488] asserts that, with probability one, 

( 0 ) --)■ 0 as W —>■ . 


It follows that, with probability one, 

Wi + ■ • • + Wt 

«■! -f--+ Wfc 

Wt+ + Wi 


For if 


then 


Til 4" 

Xi 4- • 


• 4~ 

4- Xy 


N 


> e 


> € 


0 as & > 0 . 
for an infinite number of k, 
for an infinite number of N, 
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'Which by (3) is aa event of probability zero. Also from the strong law of large 

numbers E(n) ^ith probability one. Then 

k 

Wi + Wh ^ ( Wi+ + Ft Y m + • • • + wA Q 

k \ Wi + • • ■ + / \ ^ / 

with probability one. It follows from the converse of the strong law of large 
numbers [2, p. 488] that E(JVi) = EiW) = 0. 

Write /Sfi + ■ • ■ + iSi = f/jt, C{Uk) = Vh so that 7* = {n > ^:1. Then (a) 
7* depends only on Xi, • ■', Xt, (b) 7i D 72 , 7(7*) ->• 0, Conversely 

any sequence of sets 7* satisfying (a) and (b) defines a sequential test on Xf ; 

define n = k on 7*-iC'(7*). Moreover Bin) < «> if and only if (c) ^ 7(7*) 

Jk—1 

converges [1, p 297). Now 

E(W) = lim E f (Xi + ■ • • + X*) dP = lim E f + • ‘ * + Xj,) dP 

Jf-*M fc-L Jst t-1 ■'a* 

= lim f (Xi -]-•••+ X*r) dP ~ —lim f (Xi + • • * X*-) dP, 

Juy lf-><c •'V/f 

This establishes the following 

Lemma: If EiXi) = 0, then E(W) = 0 for every test of finite exyeclcd value if 
and only if for every sequence of sets 7y satisfying (a), (b), (c), 



+ Xu) dP 0. 


From this condition we obtain easily 
Theorem 2. If E{Xi) = 0, | X,- [ < M, E(n) < w, then EiW) = 0. 
Proof: If 7iv^ is a sequence of sets satisfying (a), (b), (c), then 


f (Xl+ +X;.)dP 

■>rjf 


< M1VP(7^). 


Now the series Z P(7jv) is a convergent series with decreasing positive terms. 
It is well known that under these conditions EPiV^) 0. It follows from the 
lemma that E{W) = 0 

The question of finding sufficient conditions for EiW) = 0 more general than 
those given in Theorems 1 and 2 is of interest. The bare condition EiXt) = 0 is 
not sufficient, as the following example (which is simply the system of doubling 
the stake) shows: X, ± 2* with probability n is the smallest integer k for which 
X* > 0. A simple computation shows P(n) = EiW) = 2. It is well known 
that the expected amount of capital required for the above system is infinite. 
That this is generally true for such systems is shown by the following theorem, 
in which no hypothesis is made concerning the existence of P(n). 
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Theorem 3. If E{X,) = 0, E{W) > 0, then E{Z) = — », where 
Z = min (Xi + • • * 4" X\^. 

k^n 

Proof: It follows from the proof of the lemma that 

f (Xi+ - + Xy)dP-^-E(W). 

•’ry 

Now on Vy, Z < (Xi + ' • ■ + ^j^)- Hence 

lim / ZdP<-EiW). 

>r-*ee Jvtj 


Thus EiZ) cannot exist if E(W) > 0,sinceP(Fjif) —>0. Since Z < Xi, [ Z dP 
exists; consequently E(2 ) = — m. 
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CORRECTION TO THE PAPER “ON A PROBLEM OF ESTIMATION 
OCCTJRING IN PUBLIC OPINION POLLS” 


By H. B. Mann 
Ohio State University 

In the paper “On a problem of estimation occurring in public opinion polls” 
(Annals of Math. Slat., Vol. 16 (1945), pp. 85-90) the author made the assertion 
that, in the notation of the paper, E[(«,- — is always smaller than .©[(«< — e<)*]. 
This statement is incorrect and its supposed proof contains a numerical error 
in the fourth line from above on p. 90. 

We have 


-^L L L 2k *<*’ <'• 

■ i ^3 Hi 


a-, 


pi)J da: dy dpi 
a;!/)"] dx dy 
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The last integral is tabulated in Karl Pearson’s Tables for Stalislidans and 
Biometndans, Vol. 2, p. 93. Comparing this table -with a table of the normal 
probability integral it may be seen that there exists a value 5 such that 

> Bir]) for c < 5, 

E{e\) < E(rl) for c > 5. 

The quantity c lies in the neighborhood of 2, 

I am indebted to Professor J. W. Tukey for bringing the error to my attention. 



NEWS AND NOTICES 

Readers are invited to submit to the Secretary of the Institute news items of interest 

Personal Items 

The following members of the Institute are teaching in Army University Cen¬ 
ters in Shrivenham, England; Biarritz, France; and Florence, Italy: T. A. 
Bancroft, Alonzo Cohen, E. E. Blanche, P, R. Rider. 

Dean Walter Bartky of the Umversity of Chicago has been appointed as the 
representative of the Institute of Mathematical Statistics to the Division of 
Physical Sciences of the National Research Council. 

Mr. Clyde A. Bridger represented the Institute at the Inauguration of Dr. 
F. S. Hams as President of Utah State Agricultural College on November 16. 

Dr. C. West Churchman has resigned his position at Frankfort Ai'senal and has 
accepted the appointment of Assistant Professor of Philosophy at the University 
of Pennsylvania. 

Assistant Professor D. B. DeLury of the University of Toronto has been ap¬ 
pointed to an associate professorship at Virginia Polytechnic Institute. 

Mr. George Eldredge, formerly with the Aluminum Research Laboratories at 
New Kensington, Pennsylvania is now corrosion chemist with the Shell De¬ 
velopment Company at Emeryville, California. 

Dr Will Feller of Cornell University has been appointed as the representative 
of the Institute of Mathematical Statistics on the Policy Committee of the 
Mathematical Organizations. 

M. Bernard Hecht has joined the International Resistance Company, Phil¬ 
adelphia, as head of the Quality Control Department. 

Lt. Col. Paul Horst has returned to his previous position at Proctor and 
Gamble at Cincinnati. 

Professor Harold Hotelling of Columbia University has been made a part time 
consultant on statistical problems to the Division of Statistical Standards of 
the Bureau of the Budget. 

Dr. S. B. Littauer is now chairman of the Mathematics Department of New¬ 
ark College of Engineering at Newark, N. J. 

Lieutenant Commander A. L. O’Toole has been decorated with a Bronze 
Star Medal for his outstanding service in the South Pacific during the past two 
years. 

Associate Professor H. H. Pixley of Wayne University has been appointed 
Assistant Dean of the College of Liberal Arts. 

Dr. H. B. Mann has been appointed to an associate professorship at Ohio 
State Umversity. 

Miss Dorthy J. Morrow has been appointed to an assistant professorship at 
George Washington University. 

Professor C. J. Rees of the University of Delaware has received a citation for 
his work in a civilian capacity with the 14th Air Force Headquarters. 

89 
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Dr. L, V. Toralballa is a special instructor in the Matlipmatics Department at 
the University of Michigan. 

Associate Professor Abraham Wald of Columbia Univeraity has been promoted 
to a professorship. 

Mr Grover C, Wirick, Jr. is doing graduate work at the University of 
Michigan. 

Henry Goldberg of the Columbia University Statistical Ilpsearch Group died 
April 19,1945. 


During the last quarter of 1945, many members of the Institute engaged in 
statistical quality control were favored by visits from Messrs. W. A. Bennett and 
M Milbourn, the successful candidates in a scholarship competition organized 
by the Quality Control Panel associated with the Midland Region of the British 
Ministry of Production. In addition to the competition, for which witli a three 
months’ trip to the United States as a prize, 92 papers on industrial applications 
of statistical methods were submitted. This Panel has been active in organizing 
regular discussion groups and in arranging courses of lectures at the Birmingham 
Technical College, later published by the Birmingham District Committee as 
a “Symposium of Papers on Quality Control”, copies of which are still available. 

Mr. Bennett is Works Manager of the English Needle and Fi.shing Taeklo 
Co., Ltd,, of Redditoh, and Mr. Milbourn is a physicist who has worked mainly 
in the field of spectrographic analysis and physical metallurgy in the. Research 
Department of Imperial Chemical Industries, Metals Division, Birmingham, 
It is natural, therefore, that Mr. Bennett’s paper dealt with tho management 
problem of organizing a Statistical Quality Control Bureau and defining its 
duties, whereas Mr. Milboum’s paper considered the oiieration of quality control 
techniques as a means for detecting and identifying causes in production research. 

Toward the close of their visit in this country they indicated that the future 
of Quality Control, both here and abroad, will depend on establisliing an adequate 
theory of control that includes statistical along with all other necessary factors. 
This provides a challenge that must be answered by the statistical societies and 
the colleges, as well as by the quality control people. 


New Members 

The following peraone have been elected to membership in the Institute: 

Bal, Kenan Y. (Columbia) Statistical Control, Hq. AFPDC, 830 West Broadway. Ixmiaville 
3, Kentucky 

Coles, James Stacy, Ph D, (Columbia) Research Supervisor, Underwater Exploaivea Ro- 
iTr«nr Institution. Box 63t, Woado Hoh, Mans. 

^ York 

Grelder^C. Edwin, Jr., B.A, (Michigan) Actuarial Clerk, tm Olenumd BM., Scheneclady 

Psychology Dept,, Princeton University, Prince- 
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Harrison, Joseph O., Jr., B.S (George Washington) S805 Kingsbridge Ave., Apt 3F, A^eto 
York, N Y 

Hodges, Joseph Lawson, Jr^, A B (California) Operations Analyst, Army Air Forres, 
18S7 Park Road, N.W , Washington 10, D. C. 

Hoskins, Robert Heywood, A B. (Harvard) Radio Technician, Third Class, U. S Navy 
Teaching Fellow in Mathematics, Harvard University, Separation 3, Separation Center, 
Shoemaker, California 

Lowry, Edward D, Statistician (Western Cartridge Co., E Alton) 8H Bth St., East 
Alton, III. 

Rees, Prof. Carl J,, Ph.D (Pennsylvania) Head of Math Dept., Univ. of Delaware, 
Newark, Delaware 

Seth, Goblnd Ram, M.A (Delhi) Lecturer in Math, Hindu College, Delhi (On Leave) 
1S4A, John Jay Hall, 116th Street, Columbia University, New York 37, N Y. 

Sllber, Jack, B.S. (Chicago) 1)908 N. Springfield Are , Chicago £B, III 

Stone, Goldie F., A M. (New York) 678 Davison Si , Bronx, New York, N. Y. 

Szatrowskl, Zenon, Ph.D. (Northwestern) Instructor in EconoiTiies Department, North¬ 
western University, Evanston, Ill. 

Wadley, Francis Marlon, Ph D. (Minnesota) Statistical Consultant, Bur. of Entomology 
and PI. Duar,, USDA, SSIB N. Albemarle, Arlington, Virginia 

Waugh, Frederick V., Ph D (Columbia) Agricultural Economist (Office of War Mobil, 
and Recon.) 1006-i6 Street, South, Arlington, Virginia 



REPORT ON THE CLEVELAND MEETING OF THE INSTITUTE 


A meeting of the Institute of Mathematical Statistics ivaa held in (’JevcJand, 
Ohio, Thursday to Sunday, January 24-27,194G in conjunction n'ith the Annual 
Meetings of the American Statistical Association and the Econonictrir, .Society, 
The following 115 members of the Institute attended (he meeting: 

Beatrice Aitchieon, Armen A, Alcluan, Franz L. Alt, Rielmrtl L AritlfMtm, Kciinclli J, 
Arnold, Max Aatrachan, George J Auner, Kenan V. Bal, Walter Hartky, William I> liaien, 
Harold R Bollison, Archie Blake, Cheater I. Bliaa, Alliert 11. Howkcr, T. H. Hr.mii, Rtihert 
W Burgess, Oscar K Buroa, Irving W Burr, Burton fl, Camp, (’. Weat Cluirehrimn, Wil¬ 
liam G Cochran, Edward P. Coleman, Francis G Cornell, Jerome t'ornlield, Donald U. (J. 
Cowan, Dudley J Cowden, Gertrude M Co\, John H Curliaa, Joseph F. Daly, (’ulliliprt 
Daniel, Bease B Day, Walter L. Deemer, Jr , Daniel B. DeLury, W, Edwards Demiiig, 
Bernard Dempsey, Paul S Dwyer, Churehitl Eiaenharl, Mary L. Elvebar-k, Hciijarniri 
Epstein, Wilmoth D. Evans, Carl H Fischer, Irving Fisher, T, X. K, tireviJle, 7‘ri'gve 
Hanvelmo, Clausin D Hadley, Margaret J. Ilagood, K. W. llidliert, Morns If, Ilniiaen, 
Boyd Harahbarger, Byron R. Hayden, Harold Hotelling, Earl E. IltniBeman, I-mmid Hiir- 
wicz, William Hurwitz, Calvin J Kirchen, Lila F. Knudaen, Hendrik .S. Kontjn, T;alling 
Koopmans, Morton Kramer, Anita R. Kury, Robert Ludd, Diekaon II, Leaveim, Rov ladp- 
nik, E Vernon Lewis, Eugene Lukacs, Henry B Mann, tieorge F. T. Mayer, Erlwiird 0. 
Molma, Alexander M Mood, Margaret Moore, Joscpli E. Morion, Fredc’rirk M.wteL 
ler, Charles MoC. Mottley, Paul M Neurath, Horace W. Xorton, Edwin (1 Olds. Paul S, 
Olinatead, Guy H Orcutt, James G, Osborne, Russell F. Passano, Paul Peach, Alum K. 
Andrews Priestley, James Rafferty, Sophie Ilakesky, ChnrlcH F, Rmis, A R.wiimlcr' 
Herman Rubin, Phillip J, Rulon, ManonM Sandomirc, Fnuikliii E, Siiltcrtliwailc l-;«thcr 
Schaeffer, Edward M^Schrock, David II, Schwartz. G. R. Seth, Lawrence W. Shaw, Jack 
Sherman, Walter A, Shewharl, Walt R, Simmona, LcbUc, E, Simon, Jolm H. Smith, J. R 
Steen, Joseph Stemherg Henry W. Steinhaua, J. W. .Sullivan, Zenon .Szatrowmkii Bcn^ 
jamin Tepping, John W. Tukey, Helen M. Walker, W, Allen Wallia A K R Weatiimti 

y„LT'‘'' ■■■ K '.1 

The first session of the mcetinj was held jointly with the Amorican Statistical 
Association on Thmsday afternoon on Nimeneal SolutUm oj RiQrmion Bqm- 

under the chairmanship of Dr. W. E, Deming of the liurenn of the Budget 
file following papers were presented; ” 

XJr Guy Orcutt, Massachuaetta Institute of Technology 

^ Jlet/iod/or the Solution of Hcffrmwn Equations. 

Mr D B Duncan, Hoya! Australian Air Force 

3 Error Control in Matrix Calculation. 

A m ^ Aetna Life Inauranco Company 

4 I he Compact Computation of Canonical Correlations 

Professor P S. Dwyer, University of Michigan, 

afternoon setetion, 

\ as held jointly with the Econometric Society and the American Rfafiatinal 
Association on Kelattas/rrnn WonarperimenW ft! 
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Mordecai Ezekiel acted as chairman, of the morning session and Dr. R. L. 
Anderson was chairman of the afternoon session. Of the following four 
papers, the first two, were presented in the morning and the last two in the 
afternoon: 

1 The Econoviist's ProhUm of Slalialical Inference 
Professor J Marschak, Cowles Commission 

2. Prediclton and Structural Estimation, 

Mr. Leonid Hurwicz, Cowles Commission 

3 Ilerative Cotnpulation Methods in Estimating Simultaneous Relations 
Dr T Koopmans, and Mr. Hoy B. Leipnik, Cowles Commission 

4 Multivariate Analysis in Economies 
Professor Gerhard Tintner, Iowa State College 

On Friday afternoon a session on Experimental Designs and their Analysis 
was held jointly with the Biometrics Section of the American Statistical Associa¬ 
tion under the chairmanship of Professor Gertrude Cox of North Carolina State 
College. The following papers were presented; 

1 On the Uses of Orthogonal Functions in the Analysis of Incomplete Latin Squares 
Professor D B DcLury, Virginia Polytechnic Institute 

2 Use of Adjusting Factors in the Analysis of Data with Disproportionate Subclass Num¬ 
bers. 

Professor R. E. Patterson, Texas A. and M. College 

3, Selection of Sample Size for Detecting Treatment Differences 
Professor A M, Mood, Iowa State College 

4 Rectangular Lattices 

Professor Boyd Ilarshburger, Virginia Agricultural Experiment Station 

On Saturday, a two-session symposium was held jointly with the Econometric 
Society and the American Statistical Association on Sampling in the Social 
Sciences. Professor Arnold J. King of Iowa State College acted as chairman for 
the morning session and Professor S. S. Wilks of Princeton University presided 
in the afternoon. The following seven papers were presented, of which the first 
three were presented in the morning and the remainder in the afternoon: 

1. Problems and Methods of a Sample Survey of Business. 

Mr. M. H. Hansen, Bureau of the Census 

2. Problems of Area Sampling in Agriculture. 

Mr J. 11. Goodman, Bureau of the Census, and Mr, E. E, Houseman, Bureau of Agri¬ 
cultural Economies 

3 Problems of Area Sampling in Population. 

Mr, B. J, Topping and Mr. J. S Steinberg, Bureau of the Census 

4 The Problems of Non-Response, 

Mr W. N llurwitz, Bureau of the Census 

5. Systematic Sampling and its Relation to Other Sampling Designs. (Read by Title.) 
Mrs, Lillian H Madow, Washington 

6, Relative Accuracies of Systematic and Stratified Random Sampling for a Specified Class 
of Populations. 

Professor W. G Cochran, Iowa State College 



94 


BEPOHT ON CLEVELAND MEBTINQ 


7 On the Design of a Sagiple of Dealers’ Inventories. 

Dr. W E Deming, Bureau of the Budget and Dr. Willard Hinunona, Offitt of Price 
Adminiatration 

On Sunday, a symposium was held jointly with the AmmVan Statistical 
Association on Acceptance Sampling under the chairmanslup of Professor John 
W. Tukey of Princeton University. The morning session of the symposium 
was devoted to acceptance sampling by attributes and the afternoon session to 
acceptance sampling by variables. The following program was presented at the 
morning session; 


Papers' 

1 Prewar Developments. 

Mr Paul Peach, North Carolina State College 
2, Wartime Developments, 

Professor E G, Olds, Carnegie Institute of Technology 
Prepared Discussion by, 

Mr. H R. Beilinson, Array Ordnance Department 

Mr D H Schwarts, Quartermaster Corps 

Professor Walter Bartky, University of Chicago 

In the afternoon session the following program was presented: 

Papers 

1 Lot Quality Measured by Average or VariabilUy, 

Lt Commander J, H. Curtiss, Bureau of ships 
2, Lot Quality Measured by Proportion Defeclive, 

Mr, W. A Wallis, Columbia University 
Prepared Discussion; 

Mr E M. Sohrook, Array Ordnance 
Professor A, M. Mood, Iowa State College 
Professor K, J, Arnold, University of Wisconsin 
Lt Commander J. F. Daly, Bureau of ships 
Dr. A. E, R, Westman, Ontario Research Foundation 

A business meeting of the Institute was held at 6 p.m. on Saturday afternoon 
at which time reports were made by the President, Secretary-Treasumr, Editor 
and Chairman of the Committee on Development. These reports arc all 
printed in the current issue of the Annals. 

Paul S, Dwver, 
Secretary, 



ANNUAL REPORT OF THE PRESIDENT OF THE INSTITUTE 

(For 1945) 

I. Development op Public Appreciation for Mathematical Statistics 

The aims of the Institute, as stated in the constitution, are to promote the 
interests of mathematical statistics. First and foremost, research must go on. 
The Annals must be published and its position maintained as the world’s leading 
journal in mathematical statistics. Meetings must be held to provide for further 
dissemination and discussion of research. But this is not all. We should fall 
short of our opportunities for promoting the interests of mathematical statistics 
if we were to lose sight of the need for creating an environment in which mathe¬ 
matical statistics and statisticians can thrive and take their proper place for 
rendering the service that they are capable of rendermg in the political, industrial, 
and scientific life of the nation, 

A fair share of the efforts of the officers and committees of the Institute this 
past year has been devoted to the creation of this environment. The Institute 
has assumed leadership in several movements of importance in this direction 
and has lost no opportunity to cooperate with other organizations toward the 
same ends. Momentum has thus been given to important developments which 
are bound to affect the scientific advancement and employment opportunities 
of all people engaged in statistical work of any kind, whether it be mathematical 
research, consulting, teaching, major or minor roles in large-scale statistical 
proj'ects, preparing questionnaires, designing experiments, analyzing results, 
formulating conclusions and recommendations, or taking part in any other way 
in the collection or use of statistical data. Briefly, these developments fall 
under three main headings. 

(i) SeMing standards of ‘professional competence. . The Description of the Pro¬ 
fession of Statistics, put out by the National Roster this year, has gone a long 
way as a first step toward setting standards of professional competence. The 
officers and many members of the Institute assisted the Roster, particularly 
Professor Harold Hotelling and his Committee on the Teaching of Statistics, 
together with Dr. C. I. Bliss representing the American Statistical Association. 
Although the Roster Description is not intended to represent the official attitude 
of the Institute, it does represent cooperative effort toward cultivation of public 
understanding of statistical work. 

(m) Raising the standards of teaching. Standards of teaching go hand in hand 
with standards of professional competence. The Institute can proudly point 
to the accomplishments of its Committee on the Teaching of Statistics, which 
under the chairmanship of Professor Hotelling, has persistently set forth stand¬ 
ards of teaching which are bound to bring about important changes in the ar¬ 
rangement of statistical courses and organization of statistical teaching. An 
inevitable result mil be greater competence m statistical theory, better research, 
and expanding avenues for more effective application of theory. 

95 
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(tii) Promoiing public understanding and apprrcialum far ihr slnlislirian. 
More adequate public appreciation of statistical tlieory can hn brought about in 
several ways. The first two of these are being actively purMU-d l»y the ofRcera 
of the Institute The third constitutes a proposal; and the fourth, an obligation 
incumbent on every member of the Institute. 

First, through joint meetings with other professions such oh sociologists, 
economists, psychologists, engineers, biometricians, etc. The C'levekntl meet¬ 
ing is an example, the St. Louis meeting of the AAAH to he held in March is 
another. These joint sessions give opportunity for other groups to limime 
aware of the impact of mathematical statistics on their own work, and for mathe¬ 
matical statisticians to hear of the statistical problems in other fields. (ipportu- 
nities for such diffusion of knowledge exist in local chajitcrs as ivell as in national 
meetings, and every member of the Institute should he on tlie lookout for oppor¬ 
tunities to explain how problems in administration, management, economics, 
and manufacturing, are going to require modification in the future owing to new 
work in sampling techniques, acceptance procedures, quality' control, and other 
developments of mathematical statistics. 

The federation of statical sooieties (see Part III) will afford better means 
than existed heretofore for an admbeture of math'emalical statistics with fields 
of application, both in national and local meetings. 

Second, through the work of committees whose responsiliilily i.s to advise 
professional groups, and government and private, research agencies, conce-rning 
the use of mathematical statistics, A notable example is llie Joint (‘ommittee 
for the Development of Statistical Applications in Kngim‘ering and Manufactur¬ 
ing, of which Dr. W. A. Shewhart is chairman. The Institute has two repre¬ 
sentatives on it. Much of the lecent advancement of .statistics in industry is 
traceable to the work of this committee. 

Third, through the establishment and publication of colloquium led urea as 
recommended by Dr Shewhart in his report for the preceding j'ear, or of an 
annual Rietz lecture of broad interest aa recommended liy this years' Committee 
on Development (cf. Appendix A, Part V). 

Fourth, information through expository nonmathematical articles and lectures 
delivered by leading mathematical statisticians before gatlieringa of ucmslalisti- 
cal groups of professional and business men. Such activity is of course informal 
and without record, carried on by individuals as opjmrtunity permits and not by 
official announcement from the office of the Institute. 

II. LoNQ-RANGR PUANNINO 

Through the M'ork of several of the Institute’s commitfees, each tackling 
specific areas of enquiry, the Institute is being providctl with long-range policies 

and planning. In particular, the reports of the following committees should be 
cited m this connection: 

The Committee on Development (Appendix A) 

The Committee on the Teaching of Statistics (Appendix B) 
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The Committee on Finance (Appendix C) 

The Committee on Policy in Regard to Local Chapters (Appendix D) 

These committees are obviously alive to the recent rapid expansion of mathe¬ 
matical statistics in industry and goverpment, and to the opportunities that lie 
ahead for developing proper environment foi greater expansion and service of- 
mathematical statistics. 

in. Federation op Statistical Societies 

A movement of extreme importance to all statistical workers is the proposed 
reorganization of the American Statistical Association as the central organization 
for all statistical societies. This movement owes its impetus largely to the 
recommendation made by our Committee on Development a year ago, and to 
the active part that our officers and representatives played in organizing and 
assisting the Inter-Society Committee. This movement is centripetal and 
replaces the centrifugal forces that were splitting statistical organizations. 
Under the new arrangement, statistics will possess a united front on matters of 
common interest, yet each organization will maintain its autonomy. Nothing 
is to be sacrificed in the ivay of standards of membership, meetings, or publica¬ 
tions Economies will be effected through combined office operations. Much 
will be gained through coordinated effort; wide distribution of a journal of 
general methodology and applications; development of public appreciation for 
statistical work through dissemination of reliable information concerning statis¬ 
tical science and its contributions; cooperation with local and international 
statistical groups; promotion and development of professional standards of 
statistical work; and through cooperation with other professional groups in 
fields of application. 

This federation is not yet accomplished; it is still in process of formulation, but 
it is probably safe to say that agreement on general aims has been reached, as 
well as on many items of detail. The proposition will in time be put up to each 
statistical organization for acceptance. 

IV. Growth and Expansion 

During the year the membership increased from 606 to 777, The work of 
the Institute, vitally affecting many thousands of statistical workers through 
its efforts to enhance public confidence and appreciation for theoretical statistics 
as well as to improve the quality of statistical work, extends far beyond the en¬ 
vironment of its nearly 800 members. Concerted drives for membership should 
continue, but should not be expected to take the place of personal invitation 
in the form of explanation, one man to another, of what the Institute stands for. 
The outlook is encouraging. Year by year as the work and influence of the 
Institute receive wider success and recognition, more and more people will be 
found ready and desirous of joining. 
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V, Administrative Affairs 


As with any active organization, there are certain chorw to he done and inter* 
nal affairs to be administered. The chief burden falls on tiu! executive officer, 
our Secretary-Treasurer, Paul S. Dwyer, who is e-Xiiected 

i. To keep the list of members up to date with addrwsca and titles. Purmsh 
information to the Board regarding increases and derr<‘a«*s m member* 
ship, and issue the Directory. 

ii. To send out notices, to keep the membership infonnwi conrerning meet¬ 
ings and other items of interest. 

iii. To send out bills, and keep the books allowing payment of dues and sub¬ 
scriptions, 

iv. To fill orders for back numbers of the Annals. 

V. To estimate the probable demand for copies of the Annals, current and 
past, and to place orders rvith the printer to be able to supply the demand. 

vi. With the Committee on Finance, to kwip the Boanl ptmtrxl cm the ex¬ 
pected expenditures and income for the year ahead. 

vii. To answer correspondence from other organisations and individuals who 
desire information concerning the Institute. 

viii. To keep a record of proceedings of the Board and businesa nipvtings of the 
Institute. 

ix. To work with the various committees of the Institute, keeping them in¬ 
formed and in Ime on policy, constitution, by-laws, uiid other corarait- 
menta. 

X. With the Committee on Programs, to arrange 8ea«i(ms of contributed 
papers, and to find space in hotels or elsewhere for iiuhling merdings and 
housing members. 

XI. To keep the Board informed concerning recommendations and reports of 
committees, and other matters brought to his attention retjuiririg action 
by the Board, 


xii To conduct continuous membership and subscription drives with or with¬ 
out the aid of committees. 

It is obvious that when an organization reaches the size and activity of the 
Institute, these duties are too onerous to carry on without prtiper assistanee. 

ur Secretary-Treasurer should be freed for proper perfonfiiance of important 
functions which only he can render toward the growth and vitalization of the 
Institute Consideration is being given to two possible plans, eitlier of which 
increase in expenditure. One plan is to provide competent and 
sufficient assistance in the office of the Secretary-Treasurer, and the other is to 
rans er some of his duties (e.g., Items i, ii, iii, iv, x, and xii) to the American 

S!!! ^ cooperative arrangement of this kind 

T 1 .C.+ Institute has been discussed informally with Mr. 

pstimlrnr mi A,8.A., who will be able to provide ua with wt 

stmiates a little later. This kind of arrangement would be a first step and servea 
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as a pilot study in. cost-accounting for the ultimate federation of statistical 
societies (Part III). 

The constitution must be revised, and a committee has been formed to under¬ 
take the task. The one we have has served well, with minor revisions, over 
the first ten years in the life of the Institute, but conditions are now different and 
thorough reconsideration is needed. Among other things, it needs to be revised 
to permit federation with other statistical societies. As it stands it is totally 
deficient in specifying responsibilities between local chapters and the parent 
society. It should embody the recommendations of the Committee on Policy 
in Regard to Local Chapters, or modifications of these recommendations. Also, 
there are ambiguities in the prfesent constitution that need to be cleared up, and 
there is no provision for carrying out the business of the Institute by correspond¬ 
ence when a Board meeting or Committee meeting can not be held. 

The Committee on Meetings must not only seek out suitable papers for meet¬ 
ings, cariying out the wishes of the Board in regard to the subject-matter to be 
covered, but must also be concerned with the geographic location of meetings, 
cooperation with other professional societies, and choice of dates. During the 
past few years, in addition, this committee has had to contend with restrictions 
on transportation and hotel 8ps.ce. The Committee on Finance must decide 
what expenditures are wise and allowable; they must make decisions on in¬ 
vestments and surety bonds. They have calculated the price of life-memberships 
for purchase at various ages. Committees on Membership and on Subscriptions 
must be active. The services rendered by these committees deserve the grateful 
thanks of the members of the Institute. 

Undoubtedly the most lasting contribution that is bemg made by the Institute 
to research in mathematical statistics is the publication of the Annals of Mathe¬ 
matical Statistics. Without some first-hand knowledge of the problems that are 
encountered in publishing a professional journal of high standing it is hardly 
possible to be conscious of the depth of the debt owed by the Institute to Dr. 
Samuel S. Wilks, Editor. During the past few years, in addition to the normal 
editor’s problems of maintaining standards of excellence in the articles published, 
there have been additional diSiculties and delays arising from paper and man¬ 
power shortages in printing. 

In closing this section it is a pleasure to record our appreciation of the as¬ 
sistance and advice received at various times during the year from Mr. Lester 
Kellogg, Secretary of the A.S.A.; also from Mr. E. A. Stephens of the Ohio Bell 
Telephone Company in Cleveland in regard to the difficult problems of hotel 
space which arose in connection with the Cleveland meeting in January 1946. 

VI. Election of Fellows 

Acting in consideration of the advice of the Committee on Membership, the 
Board advanced the following members to the grade of Fellow: 

M. S. Bartlett, Cambridge University 
Trygve Haavelmo, The Norwegian Embassy 
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William N, Hurwitz, Bureau of the Clcnsus 
John von Neumann, Institute for Advanc«i Study 

VII. Election op Ofpicebs 

The following officeis were duly nominated and elected for 1946; 
President, William G. Cochran 
Vice Presidents, Will Feller 

Edwin G. Olds 


VIII. Committees and Reports op Committbb« 


Our committees and representatives on joint committees for the year 1946 are 
shown below. The reports of these committees are appended for the information 
of members. It should be borne in mind that committee reports are for con¬ 
sideration of the Board, they do not commit the Board to any specific action one 
way or another. As already intimated, every member of the Institute may take 
pride in the splendid work of these committees. Like the deliberations of the 
Board, most of the deliberations of the committees were necessarily carried out 
by correspondence because no large meetings were held at which the members 
of any committee or the Board could all be brought together. 

During the year we have been asked by Dean L. P. Eisenhart, Chairman of 
the Division of Physical Sciences of the National Research Council, to name a 
representative. The Board duly appointed Dean Walter Bartky. Tlie invita¬ 
tion from Dean Eisenharb to be so represented is a distinct honor and a rccopi- 
tion of the importance of the Institute in pure and applied research. 

We have also been invited to name a representative to the Policy Committee 
for Mathematicians, to which the Board has named Professor Will Feller. On 
the committee are four representatives from the American Mathematical So¬ 
ciety, one from the Society for Symbolic Logic, and one from the Institute of 
Mathematical Statistics. The Mathematical Association of America haa been 
invited to name two representatives. The constitution and purp(»e.8 of this 
committee are explained in the following paragraphs which are taken from a 
statement that was approved by the A.M.S. Council on November 23, 1946: 


Representatives of each organuation shall be selected in accordance with ^ plan approved 
by the governing body of that organization. 

Mathematical Society eball be a non-voting, ex officio 
member of the committee and shall act as secretary for thevommittee. 

study those problems alToctlng the mathematical profoesioa 
to enll for+1! r”! ‘’'’“»tituent organizations. It shall be empowered 

mCin Lh urganuations on matters which concern the position of mathe- 

conoerninir the effeef,-! or enacted legislation concerned with science, problems 

other questions which t niatheinaticians orpolenlial members of our professiori, and 

among related sSnees wJ?!® n® 

alrfarSe commitments 

in ernatipnal basis by any of the constituent organizations 
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(i e., among these is the International Congress of Mathematicians for which an invitation 
was issued by the American Mathematical Society in 1936). 

This Policy Committee shall be appointed for a period of five years. At the end of that 
time the work of the committee shall be reviewed and a decision made concerning the con¬ 
tinuation of the committee 

A supplemental motion passed by the A.M.S. Council asks the Policy Com¬ 
mittee to concern itself primarily with the profession of mathematics and only 
secondarily with the teaching of mathematics. 

W. Edwards Deming, 

President, 1945 . 



Committees of the Institute 


Gommilie& 

Development 

PertonMl 

William G. Cochran, Chairman 
Paul S. Olmstcad, 

Acting Chairman 
Chester I, Bliss 

Henry Scheff^ 

C. C, Craig 

Frederick Hosteller 

App^ndise 

A 

The Teaching of Statistics 

Harold Hotelhng, Chairman 
Walter Bartky 

Milton Friedman 

W. Edwards Deming 

B 

Finance 

Paul S. Dwyer, Chairman 
Charles F. Rods 

Carl Fischer 

A, C, Olflhen 

G 

Policy in Eegard to Local 
Chapters 

Morris H. Hansen, Chairman 
Gertrude Cox 

Samuel S. Wilks 

D 

Meetings 

John H. Curtiss, Chairman 

T. Koopraana 

William G. Madow 

E 

Membership 

Joseph L. Doob, Chairman 

Paul S. Dwyer 

T. Koopmans 

Will FeUer 

F 

Increasing Subscriptions to 
Libraries and Laboratories 

W. D. Baten, Chairman 

Harold F. D(^ge 

Irving W. Burr 

L. Aroian 

G 

Tabulation 

Paul S. Dwyer, Chairman 

Will Feller 

Churchill Eisenhart 
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Nominations 


C. C. Craig, Chairman 
Frederick F. Stephan 
Gertrude Cox 


Revising the Constitution and 
By-Laws 


Morris H. Hansen 
Allen T. Craig 
Chester I. Bliss 
John Curtisfe 


Representatives to the Inter- John H Curtiss 
Society Committee on Federa- Paul S. Olmstead 
tion 


Representative to the Division Walter Bartky 
of Physical Sciences, National 
Research Council 


Representative to the Policy Will Feller 
Committee for Mathema¬ 
ticians 


Representative to Explain the W. Edwards Deming 
Need of Mathematical Statis¬ 
tics in Research for Defense 

Representatives to the Joint Samuel S. Wilks 
Committee for the Develop- Paul R. Rider 
ment of Statistical Applica¬ 
tions in Engineering and 
Manufacturing 


Appendix A 

Report from the Committee on Development 
I, Geneba-L 

Continuing the work of the 1944 Committee on Post-War Development, this 
Committee has analyzed the purpose and policy of the Institute to see what 
additional activities the Institute should undertake in order to provide further 
stimulus to the development of the field of mathematical statistics. The fol¬ 
lowing existing and proposed activities were considered: 

1. Maintenance of professional standards 
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2. PuUications program 

3. Meetings program 

4. Rietz Lecture 

5. Chapter policy 

6. Cooperation in determining educational standards 

7. Maintaining relationships with other technical socieUca 

8. Increasing membership of the Institute 

In genera], each of these activities is placed in the hands of a committee. Kxcept 
m a few instances, reports of these committees have not been published in the 
Annals. This- committee recommends that each of the committees of the Insti¬ 
tute together with the representatives of the Institute on joint committees be 
requested by the Board of Directors to submit a yearly report for pofisible publica¬ 
tion in the March issue of the Annals so that the members of the Institute may 
be kept informed concerning the Institute’s affairs. 

II. Phofessional Standards 

This committee believes that the Report of the Memberahip Committee 
published in the March 1945 issue of the Annals is typical of the kind of report 
desired, providing, as it does, an outline of present standards for membership 
in the Institute, 


III. PUBUCATIONB 

The publication program has been discussed with the liklitor anti we find that 
we are in agreement with the present editorial policy. Wo recommend that the 
Editor submit a yearly report. 

Although an increased membership among those engaged primarily in the 
application of statistics is desirable, it is not considered advisable to alter radically 
the character of the Annals in order to attract such memberahip. However, 
writers on theoretical topics in the Annals should be encouraged to include illus¬ 
trations of applications whenever feasible. A desirable goal at wdiich to aim 
would be for every issue of the Annals to contain an expository paper reviewing 
progress in a broad field of theory or devoted to new' fields of existing theory 
(these functions are not mutually exclusive). It seems more difficult to obtain 
good papers of this kind than research papers. Now that statisticians are leav¬ 
ing war work the prospect for obtaining such papers should improve. Tlie 
committee has been informed that the Editor has invited certain writers to 
contribute expository papers on assigned topics and it is recommended that this 
poicy be continued^ It is believed that the members of the Institute w'ould 

1 e to be informed in the Editor's report concerning progress in receiving such 
papers. 

Last year this committee considered the possibility that the Institute sponsor 
tne publication of a series of books and monogi'apha. In view of recent develop- 
men s in t e commercial publishing field it seems that there is ample opportunity 
or e pu leation of such works as the Institute might otherwise undertake to 



BEPOBT OP THE COMMITTEES 


105 


publish, and the committee therefore recommends against such Institute action 
at this time. 


rv. Meetings 

Under normal conditions of tiansportation, the Institute has held at least two 
meetings each year, one with the mathematical societies in the summer and one 
with the social science societies in the winter. This committee favors the con¬ 
tinuation of this system Occasionally, meetings have been held with an en¬ 
gineering society. This program does not provide specifically for joint meetings 
with societies devoted to (a) standardization, (b) engineering, or (c) natural 
sciences Arrangements for meetings under (a) and (b) could be made through 
our representatives on the Joint Committee for the Development of Statistical 
Applications in Engineering and Manufacturing, which has representation from 
each of these groups. This committee recommends that the Program Committee 
have on its membership one of the Institute’s representatives on the Joint Com¬ 
mittee and one who is active in the natural sciences. Important duties of these 
members are to give advice on the t 3 rpe of program desiied for joint meetings 
in these applied fields and to make arrangements for the meetings. It is also 
recommended that the Program Committee include Institute members who are 
active in the mathematical societies and in the social science societies so that our 
participation in meetings with these groups will be integral to their piogiam,s. 
Other members of the Program Committee may be chosen with similar aims in 
mind, The yearly report of the Program Committee should discuss among 
other matters the progress made in arranging joint meetings. 

V Rietz Lectxthe 

To direct attention to the work of the Institute, it is recommended that the 
Institute sponsor an annual lecture of broad interest, to be named after its first 
president, the late Professor Henry L Rietz. It is suggested that the lecturer 
be appointed by the Board of Directors, that he be given a year’s notice, and that 
the lecture be arranged for a meeting with an appropriate society, 

VI. Chapters 

In establishing chapters, the Institute has undertaken obligations that to date 
have not been fulfilled. Two courses are open. Either the Institute should 
abolish its existing chapters or it should formulate a policy that will provide for a 
vigorous chapter program. Some requirements for chapters have been set down 
by the Committee on Policy with Regard to Local Chapters (Appendix D). 
It is proposed that this be submitted to the secretaries of our chapters for their 
comments. Further, certain broader aspects of the problem require additional 
consideration. Discussion with various members of the Institute indicates 
that some believe that the interests of the Institute because of its relatively 
small membership might be better served by organizing geographical sections 
rather than chapters. Pending final agreement on these points, this committee 
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recommends that the Board of Directors hold in al>eyaiice any requtwts for the 
formation of new chapters. 

VIL Edticationae Standarps 

The matter of educational standards for college coursoK w now in the hands 
of the Committee on the Teaching of Statistics. Such a committee should be a 
permanent committee of the Institute. 

It is our further recommendation that one member of this committeo be one 
of the representatives of the Institute on the Joint Committee for the Develop¬ 
ment of Statistical AppUcations in En^eering and Manufacturing. It Aould 
be his duty to assess needs for statistics couraes, particularly in relation to stand¬ 
ardization and engineering. 

VIII. Relationships with Other Technioae Societies 

In 1929, the Joint Committee for the Development of Statistical Applications 
in Engineering and Manufacturing was formed. Tlie Institute haa had two 
representatives since 1937. The other sponsor Bocietiea for the Joint C-ora- 
mittee are: 

American Society of Mechanical Engineers 
Amencan Society for Testing Materials 
American Statistical Association 
American Mathematical Society 
American Institute of Electrical Engineers 

Much of the use of statistical method in the war effort is traceable directly to the 
activity of this committee. In particular, this committee is working continu¬ 
ously to see that statistical methods and statistical concepts are introduced in 
connection with work on standardization, engineering, and the natural and social 
sciences. In a report published in the December 1940 issue of the AnnaU, the 
Institute’s War Preparedness Committee made the following recommendations: 

The Institute should “cooperate to the fullest in matters pertaining to quality control 
and specification with the ‘Joint Committee for the Development of Statistical Applica¬ 
tions in Engineering and Manufacturing,’ of which the Institute is a sponsor.” 

Sbe specific steps for a cooperative program with the Joint Committee were 
outlined. However, although this report was accepted by the Board, no action 
was taken on these recommendations. In view of the above, we make the 
following recommendations to the Board: 

1. That the Institute’s representatives be requested to make a report on the aotivltiea 
Committee. (This should be the first of a series of yearly reports.) 

2 That the Board request a report from the Joint Committee on the status of atatiatica 
and statisticians in engineering and manufacturing including forocaets of future needs 
and opportunities. 

3. That the Board request a report from the Joint Committee on the status of statistics 
in the training of engineers including recommendations for such training in the future. 

4. That at least one of the Institute’s representatives be from the engineering or manu¬ 
facturing field. 
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IX. Growth of the Institute 

The Committee on Development has examined the record of gi'owth of the 
Institute and finds that the largest increase in recent years has been among 
people from industry, a group that is still leas than a quarter of the total mem¬ 
bership. It is believed that the program outlined above will stimulate growth 
in membership among all users and potential users of mathematical statistics. 

X. Publicizing Mathematical Statistics 

This Committee recommends that the Institute make available to appropriate 
channels of public information reliable communications concerning mathematical 
statistics. As a specific recommendation, the case for the science of statistics 
should be presented at the hearings of the National Research (Science) Founda¬ 
tion Acts pending in Congress, preferably by representatives acting jointly for 
the Institute and the American Statistical Association. 

XI. The Inteksociety Committee 

A second meeting of the Intersociety Committee mentioned in last year’s 
report is to be held on December 8th. This Committee feels that consideration 
of proposals for reorganization of the Institute should not be undertaken prior 
to advice concerning the action of that Committee. 

W. G. Cochran, Chairman P. S. Olmstead, Acting Chairman 
C. I, Bliss C. C Craig 

F. C. Mosteller H. ScHEPPii 

November 5, 1945 


Appendix B 

Report from the Committee on the Teaching of Statistics 

A preliminary draft of recommendations in the teaching of statistics was read 
by the chairman of this committee at the,Rutgers Meeting at the Institute in 
September 1945. These recommendations are at present being re-drafted by 
members of the Committee and it is hoped that they would be ready to present 
to the Board in the near future for possible publication in the Annals. 

Assistance was rendered during the first part of the year to the National Roster 
of Scientific and Specialized Personnel, toward the development of a formal 
description of the profession of statistics (mentioned in Part I of the Annual 
Report of the President). This assistance was carried out jointly with Dr, 
Chester I, Bliss who was appointed by the American Statistics Association to 
assist with this project. It is believed by this Committee that the description 
put forth by the Roster -will help bring about recognition of standards of pro¬ 
fessional competence in statistics and in the teaching of statistics. 

Harold Hotelling, Chairman 
Walter Bartky 
Milton Friedman 
W. Edwards Deming 
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Appendix C 


Report from the Committee on Finance 


'the Committee on Finance met in the office of Dr. C. F. Roos in New York 
City on September 14, 1945. Present were Messrs. Roos, C. H. Fischer, P. S. 
Dwyer; absent, A. C. Olshen. 

The Treasurer presented a summary of income and expense.s during the third 
quarter of 1945 through September 13. This information was considered along 
with the first half year reports which were prepared some months ago. The 
Treasurer also presented a graph showing balance on hand at the end of each 
month (1939-1945) and one showing income during each month (1939-1945). 
These facts, as well as other pertinent information, were used in formulating the 
recommendations which follow. 

The’Finance Committee proposes to the Board of Directors that the following 
recommendations be approved by the Board as policy for the Institute of Mathe¬ 
matical Statistics. 

1. That no revision be made with reference to the adoption of the expected 
budget for 1945. It appears now that the income will be somewhat higher than 
the amount indicated on the expected budget ($6450) and that the amount of 
expense should he somewhat lower the amount there estimated ($0050), 

2. That the Secretary-Treasurer be instructed to prepare an Annual statement 
for 1945 on the general plan of previous annual statements with the addition of 
an analysis of assets and liabilities. The main assets are cash, bonds, and back 
issues of the Annals It is recommended that the back issues be valued at 75 
cents per copy (for inventoiy purpose)—a fair estimate of cost. It is further 
recommended that no value be placed on exchanges and office equipment. 

3. That the Secretary-Treasurer prepare the annual statement prior to the 
winter meeting, which means presumably that the books will be closed about 
December 10th. 


4. That, in consideration of the nature of the graph of the income of the 
Institute, the Institute adopt the policy of having its yearly report run from 
July 1 to July 1 and that the Secretary-Treasurer be instructed to draw up an 
additional annual report as of June 30, 1946. 

5. That the Secretary-Treasurer be instructed to draw up a budget for 194G 
and to submit it to the Finance Committee in sufficient time so that action may 
be taken on it by the Board at its winter meeting. 

6 That the U. S Government G Bonds now owned by the Institute ($3000) 
be listed on the books at their face values even though the market values of those 
bonds are slightly lower, 

7 -That the total amounts of all life membership payments be placed in a 

i y ^ The market value of these bonds 

8 ThTfr f tbe amount of this fund at any accounting period. 

. at the Secretary-Treasurer be authorized to take whatever steps are 



HEPOBT OF THE COMMITTEES 


109 


necessary to obtain adequate interest on our liquid assets. That he maintain 
sufficient cash position to carry on the business transactions of the Institute and 
that he invest the remainder (a) either in U. S. Government G bonds or (b) in 
short term bonds. 

9. That the purchase from Professor Carver of all back issues jointly owned by 
Professor Carver and the Institute be made an item of the budget for 1946. 

10. That the Secretary-Treasurer be instructed to purchase a $2,000 fidelity 
Bond Form B (a form which covers negligence as well as dishonesty) for 3 years 
for the office of Secretary-Treasurer. 

11. That a policy be adopted of allowing a straight 10% discount to all agencies 
and booksellers who send us subscriptions or orders for back issues. 

12 That the Institute set up a permanent Committee on Finance with the 
Secretary-Treasurer as ex-officio member and chairman. There shall be three 
additional members with terms of three years with a new member each year, 
At the formation of the Committee one member shall be appointed for one year, 
one for two years, and one for three years. A resignation from the Committee 
shall be followed by an appointment for the unexpired term. 

13. That the Board notify any committee working on revision of the Constitu¬ 
tion and By-Laws that it is supporting a permanent committee on Fmance and 
believes it appropriate that a statement of the organization and duties of this 
committee should appear in the By-Laws. 

Paul S. Dwyeb, Chairman 
Cabl H. Fischbk 
Abbaham C. Olshen 
Chables F. Roob 

September 15, 1945 


Appendix D 

Report from the Committee on Policy with Regard to Local Chapters 

Attached to this report is a summary of provisions for organizing and working 
with local chapters; it might be cast into appropriate form and incorporated into 
the Constitution of the Institute From these recommended provisions it will 
be clear that this committee does not favor the organization of weak inactive 
chapters. Unless the membership of the Institute gi'ows substantially it will 
be possible to have only a very limited number of local chapters under these 
provisions 

It is the opinion of the Committee that it is desirable for members of the In¬ 
stitute to amalgamate with members of other statistical organizations in the same 
area to form local statistical societies. We believe this will build stronger local 
statistical organizations and will effect greater advances in the application and 
development of effective statistical methods. Such amalgamation m the formu¬ 
lation of local societies can best be stimulated, and national leadership provided. 
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after the national statistical organizations have accomplished a federation or 
amalgamation. We therefore urge the Institute to use its influence in stimulat¬ 
ing discussion and action concerning national federation or amalgamation. 

The following further comments are made in addition to or Bupplementing 
those provisions recommended for incorporation into the CVinstitution of the 
Institute; 

1, Do not accept or reject the petition from any group until a jilan of organiza¬ 
tion is formulated. There should be clearance on the following questions; 
a. What are the reciprocal responsibilities of chapters and the parent 
organization? Wliat type of chapter activity should the Institute 
seek to promote? What kind of things can chapters do that will 
advance the purposes for which the Institute exists? 

We have indicated m the recommended provisions that the Presi¬ 
dent of the Institute should personally undertake or designate someone 
to work with the chapters in answering these and similar questions, 
b If local chapters are not active will they hinder the efforts of the parent 
organization? We believe that the existence of an inactive organiza¬ 
tion is a detriment to development of an active statistical group in a 
community. Activity can he measured in various ways: 

a. Meetings for research in mathematical statistics 

b. Joint meetings mth other professions 

0 . Bringing in new members to the parent organization 

d. Annual election of officers 

1. If members of a chapter must be members of the parent organization, the 
Secretary-Treasurer of the Institute should notify the secretary of a local 
chapter whenever a new member joins within his area. 

3. It is recommended that if a local chapter desires it, bills for Institute dues 
contain provision for collection of local dues. 

4. The Institute should not allow any local group to use its name unless the 
group contributes to the accomplishment of the aims of the Institute. 

Morris H. Hansen, Chairman 
Gertrude Cox 
Samuel S. Wilks 


Suggested Article on Local Chapters for addition to the Constitution 

1. Local chapters of the Institute of Mathematical Statistics may be organ- 

who arfi^rpTd Institute by a local organization of members 

who are resident within a given limited territory, 

2. The members of the local chapter shall be members of the Institute. 

• chapter may be established upon acceptance by the Board of Direc- 
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5. The affairs of local chapters shall be in general charge of the President of the 
Institute or a representative assigned by him to be responsible for local chapters, 
under the Direction of the Board of Directors. 

6. Any local chapter will be dissolved by: 

(a) failing for two successive years to maintain a paid membership of at 
least 25 members or to hold at least one meeting per year which shall include 
election of officers; or 

(b) by vote of the Board of Directors of the Institute 

7. Each local chapter shall transmit a report to the Secretary-Treasurer of the 
Institute within 30 days of the annual business meeting, reporting among other 
things, on its officers, the number of members, and on the meetings held during 
the year. 


Appendix E 

Report from the Committee on Meetings 

A meeting was held at Rutgers University on Sunday Sept. 16, which was 
attended by 115 members of the Institute. Simultaneously a meeting was held 
by the American Mathematical Society. The first session, which commenced 
at 10 a.m was a symposium on sequential analysis. The chairman was Professor 
W. Allen Wallis of Stanford University and Director of the Statistical Research 
Group at Columbia University. The speakers and their titles are listed below. 

1 Theory of sequential analysis. 

ProfeBBor A. Wald, Columbia University 

2. Construction of multiple sampling inspection plans for attributes from sequential prin¬ 
ciples 

Dr. Milton Friedman, National Bureau of Economic Research and the Statistical 
Research Group 

3 Applications of sequential analysis to the ranking of two populations with respect to a 
single parameter. 

Mr. Meyer A. Girshick, Bureau of Agricultural Economics and the Statistical Re¬ 
search Group 

The afternoon session was a series of contributed papers, followed by a pre¬ 
liminary report from the Institute’s Committee on the Teaching of Statistics, 
which was delivered by Professor Harold Hotelling. Dr. W. Edwards Deming, 
President of the Institute, was chairman of this meeting. The list of contributed 
papers follows hereunder. 

1. On the variance of a random set in n dimensions. 

Dr. Herbert E. Robbins, The Post Graduate School, Annapolis 

2. The non-central Wisharl distribution and its application to problems in multivariate 
analysis. 

Dr T, W Anderson, Jr., Princeton University 

3. The ejfect on a distribution function of small changes in the population function. 
Professor Burton H. Camp, Wesleyan University 

4. On composite distributions. 

Dr. Casper Goffman and Dr Benjamin Epstein, Westinghouse Electric and Manu¬ 
facturing Company 
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6 Population, expected values, and sample 

Professor Emil J. Gumbel, New School for Social Rosearch 
6. On the selection of a sample in repealed steps 
Dr. William G Madow, Bureau of the Cenaua 

7 On optimum estimates for stratified samples (Presented by Margaret Gurney, Bureau 
of the Census) 

Mr. Morris H. Hansen and Mr, William N. Ilurwitz, Bureau of the Oenaua 
8. Pearsonian correlation coefficients associated with least squares theory. (PreaetUcil by 
title) 

Professor Paul S. Dwyer, University of Michigan 

At this writing preparations are being made for a meeting to lie held in Cleve¬ 
land, January 24-27, 1946, and for a meeting with the A.A.A.S. to be held in 
St. Louis, March 27-30. 

John H. Curtiss, Chairman 
T. Koopmans 
W iLLUM G. Madow 


Appendix F 

Report from the Committee on Membership 

The Committee, after study and consideration, recommended to the Board of 
Directors that Messrs. M. S. Bartlett, T. Haavelmo, William N. Hunvitz, and 
John von Neumann be advanced to the grade of Fellow. This recommendation 
was approved by the Board. 

The Committee, with the advice and approval of the Board is preparing a 
letter to be sent to groups of people who are not members of the Institute to call 
their attention to the work of the Institute. This letter will be accomiianied by 
reprints of a recent paper by Wald and Wolfowitz on Sampling inspection-plans 
for continuous production, with a brief explanation of the field coveted by the 
Wald-Wolfowitz paper, and the statement that it and others that have appeared 
in recent issues of the Annals have already modified statistical practice in im¬ 
portant ways. 

Joseph L. Doob, Chairman 
Paul S. Dwtter 
T. Koopmans 
Will Feller 


Appendix G 

Report from the Committee for Increasing Subscriptions to Libraries and 

Laboratories 

This committee prepared suitable literature to send to prospective subscribers. 
This literature contained a concise description of the nature of the Annals, a 
table of contents for a year, and a subscription blank. 
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Alphabetical lists of public, college, university and industrial libraries were 
prepared. These lists contained the name, the librarian, and the address of each 
library They were checked for duplicates for present subscribers and sent to 
Professor Dwyer, Secretary-Treasurer. Altogether, the list contained about 
1500 libraries. 

Professor Dwyer took care of printing the literature, further checking for 
duplicates, addressing the envelopes, and mailing. 

WiELUM DowBLt Baten", Chairman 
Harold F. Dodge 
Irving W. Burr 
L. Aroian 



ANNUAL REPORT OF THE SECRETARY-TREASURER OF THE 

INSTITUTE 


(For 1946) 

Accounts of the Butgera meeting of the Inatitute appeared in the September 
issue of the Anmk. Notices of meetings of the Washington Ciiaptcr have been 
sent out from the office of the Secretary-Treasurer. 

Due to a large extent to activity of the members, the Institute lias enjoyed a 
large increase in memberhip during the year. The 606 members of a year ago 
have increased to 777. This is an increase of over 28%. 

The Secretary-Treasurer wishes to acknowledge the continued assistance of 
Professor Lloyd Knowler in looking after the back issues of the Anmk which 
are stored at Iowa City. 

The following financial statement is drawn up along lines specified by the 
Finance Committee and the Board of Directors. It covers the period December 
31,1944 to December 31,1946. 


FINANCIAL STATEMENT 
December 31,1044, to December 31, 1945 
A. Reckiptb 

Balance on Hand, December 13, 1944. <6,700.65 

Dubs . 4,108,40 

Life Membership Payments . 885.00 

SCBSCBIPTIONB . 1,515,73 

Sale OF Back Numbers . 1 787.40 

Income from Investments . 75.00 

Miscellaneous . oo 


Total 


Annals—Current 
Office of Editor. 
Waverly Press, 


B, Expenditures 


<16,112^44 


$400.00 

4,056.42 


Annals—^Back Numbers 

Purchase from H. C. Carver. 

Reprinted 300 copies . 

Vol, I No 2, Vol. II No. 1, Vol. IX No. 1. 
Iowa City Office. 


$4,450.42 

280.61 

727.60 

46.00 


Office op President. 

Mathematical Reviews. 

Office op the Secretary-Treasurer 

Printing, Mimeographing, programs, etc. 
envelopes). 


1,063.01 

. 130,25 

. 100.00 

(including stamped 
. 754.68 
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Postage and supplies . ,, 166 25 

Clerical help. , 852 95 

1,773.78 

M 1 SCE 1 XANKOU 8 . . ., SO 76 

Balance on Hand, Decembeh 31, 1945 (Cash and Bonds). 7,648.22 


$16,112.44 

C. SiiMMAHY OP Receipts and Expenotthbes 

Balance on Hand,* Decembeh 31, 1944 .$6,790.66 

Receipts duhing 1945 . . 8,321 79 

Expenditures during 1946 . 7,664.22 

Balance on Hand,* Decembers!, 1946 . 7,648.22 

Net Excess op Receipts over Expenditures, 1945 . . . 757 67 

D Comparison of Assets on December 31, 1944 and December 31,1945 

m mu 


US Government G Bonds . . . 

$3,000.00 

$6,000 00 

Life Membership Funds. 

330 00 Bank 

f 888.00 F Bonds 
(327.00 Bank Dep. 

Additional Bank Deposits ... . 

3,460.66 

333 22 

Current Accounts Receivable . 

303.73 

265.35 

Estimated Value (Cost)** 



Of back issues of Annaia 



At Iowa City. 

4,210.26 

3,826 75 

At Ann Arbor.. . 

567.00 

1,242.80 

Deduct Estimated Value of issues owned by H. C. 



Carver. 

879 60 

670.60 

Total. 

$11,001.03 

12,301.62 

Neb Gain 1945 .... 

I .. .. 

1,300.49 


E, Liabilities op Institute op Mathematical Statistics as op December 31,1946 

All bills which have been presented have been paid and there are no outstanding ac- 
oounts against the Institute of appreciable size. The $1215 in Life Membership payments 
require the Institute to provide the privileges of membership for life for the 17 members 
who have made payments. About $2600 should be credited to 1946 dues and subscriptions 

PAUL S. DWYER 

‘ Secretary-Treasurer. 

December 31,1946 

* In form of bank deposit and government bonds. 

** Value of Annah calculated at 76 cents per copy. All 1944 figures and 1946 Ann Arbor 
figures baaed on physical inventory. 1946 Iowa City figures based on book inventory. 









ANIfUAL REPORT OF THE EDITOR 
(For 1945) 

In Bpite of the war, enough papers in mathematical statiatics have been 
proposed for publication in the Annak in 1946 to keep the total volume of ma¬ 
terial at approximately 450 pages, the level which has t>ecn maintained during 
the last few years. A total of 40 papers were published in the IWS volume of the 
Annals of which 14 were short notes published in the “notes'* wet ion. The 
outlook for a sufficient number of acceptable paimra to maintain the usual volume 
of publication during 1946 looks quite favorable. Many mathmnatieal statis¬ 
ticians who were engaged in war work are now free to resume their it'scarch. 
In some cases statistical theory developed in connection with classified war 
research projects can be expected to be declassified in the near future and made 
available for open publication. 

Most of the material whioh has been published in the Annals consists of original 
research or extensions of work already publislied in mathematical statistics as 
contrasted with material of an expository character. In vieiv of the considerable 
number of newcomers into the Institute, as well ns a general increase of interest 
in probability and statistics during recent years, it would 1 k! highly desirable to 
publish more expository or survey material. Invitations have l)Pen accepted by 
several individuals to prepare expository articles, but they have. Iwen so heavily 
burdened with extra work during the war that they have bcKjn unable to complete 
their tasks, It is hoped that circumstances will now permit the preparation of 
expository articles. 

On behalf of the Editorial Committee for the Annak, the Editor takes this 
opportunity to acknowledge with thanks the refereeing awistance which has 
been received from the following individuals during 1945; R, L. Anderson, T. W. 
Anderson, George W, Browm, A. H. Copeland, W, J. Dixon, J. L. Doob, Milton 
Friedman, M. A. Girshick, M. Kac, T. Koopmans, Carl Kossack, D, H, Ix?limer, 
H. B. Mann, P. J. McCarthy, P. C. Hosteller, H. E. Robbins, J. W. Tukey, 
W. A. Wallis, J. D. Williams, and C. P. Winsor. The Editor is also indebted to 
the following individuals at Princeton University for preparation of manu¬ 
scripts for the printer, and other editorial assistance from time to time in con¬ 
nection with the Annals-. Mrs. Gladys B. Huling, Luis F. Nanni, Mrs. Euthie 
Ross, Mrs. Eleanor C, Schoenly, and John E. Walsh. 


December SI, 1945 


B. B. WiPKS 

Editor 
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CONSTITUTION 

OF THE 

INSTITUTE OF MATHEMATICAL STATISTICS 

ARTICLE I 
Name and Rubposb 

1. This orgftiiizatioD shall be known as the Institute of Mathematical Statistics. 

2. Its object shall be to promote the interests of mathematical statistics. 

ARTICLE II 

Membership 

1. The membership of the Institute shall consist of Members, Fellows, Honorary 
Members, and Sustaining Members. 

2. Voting members of the Institute shall be (a) the Fellows, and (b) all others. Junior 
members excepted, who have been members for twenty-three months prior to the date 
of voting. 

3. No person sliall be a Junior Member of the Institute for more than a limited term as 
determined by the Committee on Membership and approved by the Board of Directors. 

ARTICLE III 

Officers, Board of Directors, and Committee on Membership 

1. The Officers of the Institute shall be a President, two Vice-Presidents, and a Secret 
tary-Treasurer. The terms of office of the President and Vice-Presidents shall be one 
year and that of the Secretary-Treasurer three years. Elections shall be by majority 
ballots at Annual Meetings of the Institute. Voting may be in person or by mail. 

(a) Exception. The first group of Officers shall be elected by a majority vote of the 
individuals present at the organization meeting, and shall serve until December 31,1936. 

2. The Board of Directors of the Institute shall consist of the Officers, the two previous 
Presidenta, and the Editor of the Official Journal of the Institute. 

3 The Institute shall have a Committee on Membership composed of a Chairman and 
thrio Fellows. At their first meeting subsequent to the adoption of this Constitution, the 
Board of Directors shall elect tluee members as Fellows to serve as the Committee on 
Membership, one member of the Committee for a term of one year, another for a term of 
two years, and another for a terra of three years. Thereafter the Board of Directors shall 
elect from among the Fellows one member annually at their first meeting after thdr elec¬ 
tion for a term of three years. The president shall designate one of the Vice-Presidents as 
Chainnan of this Committee, 


ARTICLE IV 
Meetings 

1. A meeting for the presentation and discussion of papers, for the election of Officers, 
and for the transaction of other business of the Institute shall be held annually at such 
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time as the Board of Directors may designate. Additional moetinga may be called from 
time to time by the Board of Directors and shall be caUed at any time by tlic President 
upon written request from ten Fellowa. Notice of the time and place of meeting shall be 
given to the membership by the Secretary-Treasurer at least thirty days prior to the 
date set for the meeting. All meetings except executive sessions shall l>e open to the 
pubhc. Only papers accepted by a Program Committee apiminted by the President may 
be presented to the Institute. 

2. The Board of Directors shall hold a meeting immediately after their election and 
again immediately before the expiration of their terra. Other meetings of the Board 
may be held from time to time at the call of the President or any two members of the 
Board. Notice of each meeting of the Board, other than the two regular meetings 
together with a statement of the business to he brought before the meeting, must be 
given to the members of the Board by the Secretary-Treasurer at least five days prior to 
the date set therefor. Should other business be passed upon, any member of the Board 
shall have the right to reopen the question at the next meeting. 

3. Meetmgs of the Committee on Membership may be field from time to time at the call 
of the Chairman or any member of the Committee promded notice of such call and the 
purpose of the meeting is given to the members of the Committee by the Secretary- 
Treasurer at least five days before the date set therefor. Should other business be passed 
upon, any member, of the Committee shall have the right to reopen the question at the 
next meeting. Committee business may also be transacted by correspondence if that 
aeema preferable. 

4. At a regularly convened meeting of the Board of Directors, four members shall 
constitute a quorum. At a regularly convened meeting of the Committee on Member¬ 
ship, two members shall constitute a quorum, 


ARTICLE V 
Publications 

1. The Annds of Mathmalical Statiatics shall be the Official Journal for the Institute. 
The Editor of the Amah of Mathematical Statiflics shall be a Fellow apjxiintecl by the 
Board of Directors of the Institute. The term of office of the Editor may be terminated 
at the discretion of the Board of Directors. 

2. Other publications may be originated by the Board of Directors as occasion arises. 


AttlTUliHi VI 

Expulsion or Subpbnbion 

action^of Of dues, no one ahaU be expelled or suspended except by 

action of the Board of Directors with not more than one negative vote. 

ARTICLE VII 
Amendments 

kriv be amended by an affirmative two-thirds vote at any regu- 

S mm Zi *'■' S<»ret.n.-Tr..,mr .1 l»,t thirty 
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BY-LAWS 

ARTICLE I 

Duties of the Officers, the Editoh, Boahd of Directors, and 
Committee on Membership 

1, The President, or in his absence, one of the Vice-Presidents, or in the absence of the 
President and both Vice-Presidents, a Fellow selected by vote of the Fellows present, 
shall preside at the meetings of the Institute and of the Board of Directors. At meetings 
of the Institute, the presiding officer shall vote only in the case of a tie, but at meetings 
of the Board of Directors he may vote in all cases. At least three months before the date 
of the annual meeting, the President shall appoint a Nominating Committee of three 
members. It shall be the duty of the Nominating Committee to make nominations for 
Officers to be elected at the annual meeting and the Secretary-Treasurer shall notify all 
voting members at least thirty days before the annual meeting. Additional nomina¬ 
tions may be submitted in writing, if signed by at least ten Fellows of the Institute, up to 
the time of the meeting. 

2, The Secretary-Treasurer shall keep a full and accurate record of the proceedings 
at the meetings of the Institute and of the Board of Directors, send out calls for said 
meetings and, with the approval of the President and the Board, carry on the corre¬ 
spondence of the Institute. Subject to the direction of the Board, he shall have charge 
of the archives and other tangible and intangible property of the Institute and once a year 
he shall publish in the Anndk of Malkemalical Slaiialics a classified list of all Members and 
Fellows of the Institu tc. He shall send out calls for annual dues and acknowledge receipt 
of same; pay all bills approved by the President for expenditures authorized by the Board 
or the Institute; keep a detailed account of all receipts and expenditures, prepare a finan¬ 
cial statement at the end of each year and prMent an abstract of the same at the annual 
meeting of the Institute after it has been audited by a Member or Fellow of the Institute 
appointed by the President as Auditor. The Auditor shall report to the President. 

3. Subject to the direction of the Board, the Editor shall be charged with the responsi¬ 
bility for all editorial matters concerning the editing of the Anndt of Mathmatical Sta- 
tistics He shall, with the advice and consent of the Board, appoint an Editorial Commit¬ 
tee of not less than twelve members to co-operate with him; four for a period of five years, 
four for a period of three years, and the remaining members for a period of two years, ap¬ 
pointments to be made annually as needed. All appointments to the Editorial Com¬ 
mittee shall terminate with the appointment of a new Editor. The Editor shall serve as 
editorial adviser in the publication of all ScienUfio monographs and pamphlets authorized 
by the Board. 

4. The Board of Directors shall have charge of the funds and of the affairs of the 
Institute, with the exception of those affairs specifically assigned to the President or to 
the Committee on Membership. The Board shall have authority to fill all vacancies 
ad interim, occurring among the Officers, Board of Directors, or in any of the Committees. 
The Board may appoint such other committees as may be required from time to time 
to carry on the affairs of the Institute. The power of election to the different grades of 
Membership, except the grades of Member and Junior Member, shall reside in the Board. 

6 The Committee on Membership shall prepare and make available through the 
Secretary-Treasurer an announcement indicating the qualifications requisite for the 
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different grades of membership. The Committee shall review these quahfications period¬ 
ically and shall make such changes in these qualifications and make such recommendations 
with reference to the number of grades of membership as it deems advisable. The power 
to elect worthy applicants to the grades of Member and Junior Member shall reside in the 
Committee, which may delegate this power to the Secretary-Treasurer, subject to such 
reservations as the Committee considers appropriate. The Committee shall make recom¬ 
mendations to the Board of Directors with reference to placing members in other grades 
of membership. The Committee shall give its attention to the question of increasing the 
number of applicants for membership and shall advise the Secretary-Treasurer on plana 
for that purpose. 


ARTICLE II 
Dots 

1. Members shall pay five dollars at the time of admission to membership and shall 
receive the full current volume of the Official Journal. Thereafter, Members shall pay 
five dollars annual dues. The annual dues of Junior Members shall be two dollars and 
fifty cents 

The annual dues of Fellows shall be five dollars. The annual dues of Sustaining 
Members shall be fifty dollars Honorary Members shall be exempt from all dues. 

(a) Exception, In the case that two Members of the Intitute are liusbnnd and wife 
and they elect to receive between them only one copy of the Official Journal, the annual 
dues of each shall be three dollars and seventy-five cents. 

(b) Exception Any Member or Fellow may make a single payment which will be 
accepted by the Institute in place of all succeeding yearly dues and which will not other¬ 
wise alter his status as a Member or Fellow. The amount of this payment will depend 
upon the age of this Member or Fellow and will be based upon a suitable table and rale of 
interest, to be specified by the Board of Directors. 

(c) Exception. Any Member or Junior Member of the Institute serving, except as a 
commissioned offioer, in the Armed Forces of the United States or of one of its allies, may 
upon notification to the Secretary-Treasurer be excused from the payment of dues until the 
January first following his discharge from the Service He shall have all privileges of 
membership except that he shall not receive the Official Journal. However during the 
first year of his resumed regular membership he may have the right to purchase, at 82.60 
per volume, one copy of each volume of the Official Journal published during the period 
of his service membership. 

2 Annual dues shall be payable on the first day of January of each year. 

3 The annual dues of a Fellow, Member, or Junior Member include a subscription to 
the Official Journal. The annual dues of a Sustaining Member include two subscrirt" inriH 
to the Official Journal, 

4. It shall be the duty of the Secretary-Treasurer to notify by mail anyone whot ' iC; ^ 
may be six months in arrears, and to accompany such notice by a copy of tliis ^ W 
If such person fail to pay such dues within three months from the date of mailing 
notice, the Secretary-Treasurer shall report the delinquent one to the Board 
by whom the person’s name may be stricken from the rolls and all privileges 1' 
ship withdrawn. Such person may, however, be re-instated by the Board K hrtB I 
upon payment of the arrears of dues. 
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ARTICLE III 
Salakies 

1. The Institute shall not pay a salary to any OlFicer, Director, or member of any 
committee. 


ARTICLE IV 
Amendments 

1. These By-Laws may be amended in the same manner as the Constitution or by a 
majority vote at any regularly convened meeting of the Institute, if the proposed amend¬ 
ment has been previously approved by the Board of Directors. 




CONTRIBUTIONS TO THE THEORY OF SEQUENTIAL ANALYSIS. I 


By M. a. Gihshick 
United Stales Department of AgrimiUure 


PART I Ai’I'licationk op Sequentiao Analysis to the Ranking op Two 
Populations With Respect to a Single Parameter. 


1. Summary. Given two populations iri and 712 each characterized by a dis¬ 
tribution density f{x, 6) which ia assumed to bo known, except for the value of 
the parameter 0, It is desired to test the composite hypothesis 0i < 02 against 
the alternative hypothesis 0i > 02 where 0, is the value of the parameter in the 
distribution density of n, (i = 1, 2). 

The criterion proposed for testing this hypothesis is based on the sequential 
probability ratio and consists of the following; 

Choose two positive constants 0 and b and two values of 0, say 0? and 0S. 

Take pairs of observations Xi„ from n and a:2a from 712, (a = 1,2, . ,), in sequence 
) 

and compute Z/ = Z) 2a where 

a*»l 


= log 


02)/(®la) 0l)_ 


The hypothesis tested is accepted or rejected depending on whether Z„ > a or 
Zn ^ ^ where n is the smallest integer j for which either one of these relation¬ 

ships is satisfied. 

The boundaries a and b are partly given in terms of the desired risks of making 
an erroneous decision. The values 0? and 0? define the magnitude of the differ¬ 
ence between the values of 0 in tti and in 712 which is considered worth detecting. 
It is shown that the power of this test is constant on a curve /i(0i, @2) = constant. 

fix, eir 


HE 


V ®/(x. ei)J 


is a monotonic function of 0, then the test is unbiased in the 


7(X. 0?)> 

sense that all points (0i, 62) which lie on the curve h{6i, 62) = constant are such 
that either every 0i < 02 or every 61 > 02. For a large class of known distribu¬ 
tions the quantity h ia shown to be an appropriate measure of the difference 
between 0i and 02 and the test procedure for this class of distributions is simple 
and intuitively sensible. 

For the case of the binomial, the exact power of this test as well as the distribu¬ 
tion of n is given. 


1,1 General discussion. C-’onsidcr two processes (populations) ttj and X2 
each yielding a measurable quantity x whose distribution density f(x, 0) is as¬ 
sumed to be known except for the value of the parameter 0. On the basis of a 
random sample obtained from each, it is desired to choose that process which 
yields the smaller (or larger) 0. That is, it is desired to devise a test which will 
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result in a high probability of accepting ri if the 0 characterizbg its distribution 
density is smaller (or larger) than the 9 in Ta, a high probability of rejecting arj 
(i.e. acceptmg n) when the opposite is true, and approximately equal probability 
of making one or the other decision if the value of 0 in iri is the aame as in xj. 

As an illustration of the type of problem here considered, let us assume that a 
manufacturer is faced with a choice between two competing processes of pro¬ 
duction, each process yielding an unknown fraction defective p and each entail¬ 
ing about the same operating cost. Based on the evidence of a random sample 
selected from each, the manufacturer wishes to choose tliat process which yields 
the smaller fraction defective. If the fractions defective in the two proiiosses 
differ by a significant amount, he will want a test which guarantees a high prob¬ 
ability of inakmg a correct decision. If, however, the fraction defective in the 
two processes are of approximately the same magnitude, it will be a matter of 
indifference to him which decision is reached. 

The solution given in this paper to the above problem is based on Wald’s 
sequential probability ratio teat [1]. The resulting procedure not only requires 
on the average, fewer observations for the same protection than any other test 
(which is always the case with sequential tests of this type) but is also direct and 
simple when applied to a large class of distributions commonly met in practice. 

1.2 Derivation of the sequential test when the existence of a priori probabili¬ 
ties is assumed. The choice of the probability ratio as a method of discrim¬ 
inating between the two processes is suggested by considerations of a priori 
probabilities. Let us assume that each process may iiave eitlier 0? or as the 
value of a parameter 8 in its distribution density and tliat the value 9? is more 
desirable than 62 , Let us further assume that there exists an a priori probability 
gi that a process will have b\ as a parameter and an a priori probability pj = 1 — 
that it will have 0? as a parameter. Lot the likelihood for n observations 
* 11 , * 12 , ■ , *!„ drawn from iri be designated by p(a;ui Zu, ■ • ■ , aJiH, 95) when 

9i is the parameter in iri, and by p{xn ,xa , • • • ,X\n,0\) when 9i is the parameter 
m TTi. Let the likelihoods p(a: 2 i, X 22 , • • • , aij, , 95) and p(xsi, xm , • • • , , «^) 

be similarly defined for n observations xii, Xjs, ■ • • , X 2 „ drawn from wj. Then 

(1.201) p(x.i, x, 2 , ■ .. , 9?) = n /(a;.-,«'/), b i » 1, 2. 

Let^.v, (f, j= 1,2),be the a posteriori probability that having obtained (a »« 

1, 2, • ■ ■ , n), that process vi has 9/ as a parameter in its distribution density. 
Then 

(1.202) = -- giP(ga,xa, • • ■ , , 95) 

gip(Xii, ,Xi„, 95) + gip(Xn, ■ ■ • , 9?) 

and 

(1.203) -- g3p(xn, • ■ ■ , Xfa, 95) _ 

gip(j-,i, ,Xi„,el) -k j;ap(x<i, • , x<„, 9?) 

for 1 = 1 , 2 . 
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In order to decide whether the hypothesis that 0? belongs to the distribution 
density of wi is more tenable than the hypiothesis that it belongs to the distribu¬ 
tion of TTj, it is only necessary to compare dn with fti • But if dii is equal to or 
greater than , the ratio dn/^ia must be equal to or greater than and con¬ 
versely . For assume that dii > Pn • Subtracting duftj from each side of the in¬ 
equality we get Piiil - jSji) > i3si(l - fin). But since 1 - dai = Pm and 1 - 0n 
= Pit, we see that Pn/Pn ^ Pn/Pn ■ Conversely, let Pn/Pn > Pn/Pii ■ Then 
j3ii(l ~ Pn) > ftid - dn), or /3ii > dn • 

From the above it would appear that a sensible sequential procedure for de¬ 
ciding whether is more likely to belong to xi than to xs is as follows: Select two 
positive quantities A and B with A > 1 and B <1. Take a pair of observations 
{xia , Jria)j (a = 1, 2, ■ ■ -)i at a time, one from each process. At each step (i.e,, 

for each sample size n) compute the ratio X = ~ . If at any stage X < B, 

terminate the sampling and accept the hypothesis that is a parameter in the 
distribution density of xi. On the other hand, if at any stage X > A, terminate 
sampling and accept the hypothesis that di is a parameter of the distribution 
density in xs. If neither hoids, that is if B < X < A, then take another pair of 
observations, consisting of one from each process. Continue this procedure 
until one or the other decision is reached.^ 

The interesting point here is that the decision function X is independent of gi 
and Pj . In fact, it is easily seen from equations (1.202) and (1.203) that 

(1 204) X = , Xm , • ■ ■, a:tn, OiMxn , »», • ■ ■ , gm, fit ) 

^ ■ P(a:n, la, • • •, Xu, xn, • • • , Xu, ^5) ’ 

1.3 The proposed sequential test as a special case of a sequential probability 
ratio test. If we examine the expression given in (1,204) we see that it is a ratio 
of two likelihoods. The numerator of the ratio is the likelihood of the 2n ob¬ 
servations under the hypothesis that el is a parameter m xi and el is a praameter 
in xa ; the denominator is the likelihood of the 2n observations under the hy¬ 
pothesis that el is a parameter in xi and 6® is a parameter in xa - Thus, the pro¬ 
posed sequential test is equivalent to a sequential probability ratio test (see [1]) 
for testing the simple hypothesis that belongs to xi and el belongs to xa against 
the alternative hypothesis that ^2 belongs to xi and el belongs to xa. We can, 
therefore, apply the theory of sequential analysis developed by A. Wald ([1] and 
[2]) to this problem. 

While the test is posed in tepis of a simple hypothesis, the solution, as will be 
shown later, is in fact a solution to a composite hypothesis. In order to bring 
this out more clearly we shall rederive a few of the results which have already 
been obtained by A. Wald. This will be done in sections 1.4,1.5, and 1.6. 

* That a decision will be reached eventually can be asserted with probability one jf the 
variance of the variate z. (defined by (1.301)) below is different from, zero (or if it is zero, 
the value of z, is different from zero), See [2], Lemma 1 As we shall see later, if, in fact, 
both processes have either 0? or flj as parameters, then the above sequential procedure will 
result in the acceptance of either process with approximately equal probability 
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In wliftt follows wc sh&U spo^k of the liypotiiosiB (0i ^ ^ 2 ) to moEO. tlic liypothosis 
that 01 is the value of the parameter in the distribution density of in and 0j is the 
value of the parameter in the distribution density of . The hypothesis (0$, 
el) will represent a specific hypothesis which we may wish to test and -will be 
used to define the decision function (the probability ratio) of the sequential teat. 

Let us fix 1 > 1 and £ < 1 and set 


(1.301) 


Zo = log 


,/(2:2.,0D/(*1-. 


where Xn, is the erth observation from a-i, aj 2 « is the nth observation from n 
and (0?. el) is the particular hypothesis to be tested against the alternative hy¬ 
pothesis ( 02 , el) Let fl = log A and ~b = log B. Then a and b are positive. 
Since the observations from n and m are assumed to be independent, log X = 

Za . Hence the proposed sequential test can be carried out in the following 

a^l 

manner. Draw one pair of observations at a time, one from xi and one from xi. 
Let zi, Z2, • • • be the values of obtained from the first, second, etc. trial. 
Let = Zi 4 - Z2 + • • + 2n, (n = 1 , 2 , • • • )• Contmue sampling as long as 
-b < Z„ < a. Whenever Z„ > o, (n = 1 , 2 , 3 , • ■ • ), terminate sampling and 
accept X2 (or xi). Whenever Z„ < —b, (n = 1 , 2 , 3 , • ■ •), ternunato sampling 
and accept xi (or X2). 

1.3a Basic assumplions. In this section and throughout this paper, wo shall 
be dealmg with sequential tests involving, as above, a decision fimction Z„ » 
2 i + 2 s + • • • + Zn, (n = 1, 2, • • • , ad inf.), where the z„’8 are independently 
distributed random variables having a common distribution fimction. Let z 
denote a random variable whose distribution is the same as the common distribu¬ 
tion of 2o,, (a = 1, 2 , ■ ■ ■, ad inf.). It will be assumed, even if not explicitly 
stated, that the distribution of z satisfies the following conditions. 

Condition i. Both the expected valve Ez of z and die variance of z exist and are 
unequal to zero. 

Condition ii. There exists a positive S such that P(e' > 1 -f- S) >0 and 
Pie' < 1 - 0) > 0. 

Condition hi. For any real value X, the expected value Ee'" = gih) exists. 

Condition iv. The first Ivio derivatives of the function gQi) exist and may be 
obtained by differentiating under the integral sign. 

1.3b. Fundamental properties of sequential tests. Let z bo defined as in 1.3a. 
Then under the assumption that the distribution of z satisfies the conditions 
specified, Wald [2] has proved the following: 

Lemma i. The probability that a decision is reached in a finite number of steps is 
unity. 

Lemma n. There exists one and only one real value h 0 such that the expected 
value Ee’'' = 1. 

Fundamental identity: The fundamental identity Pe“‘"‘[<#.(i:)]"" = 1 holds 
for all points in the complex plane for which | (/>(<) | > 1 where 4>ii) = Ee*‘. 
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Let w = log and let the distribution density of x be fix, $). Let 6 i and 

62 be any two values of d which may be distinct from 61 and dl. Then it can 
easily be verified that if w satisfies the conditions specified in section 1.3a under 
the hypothesis 6 = 61 well as the hypothesis 6 = 62 , and if moreover the ex- 
piected values of w under these two hypotheses are not equal, then a = log 


-will also satisfy these conditions when the joint distribution 

/(a;2) 6 l)fixi, 6 \) 

density of xi and Xi (xi representing the measurable characteristic in in and Xi 
in m) IS either/(xi, di) /(xj, df) or/(xi, ef) /(xa, 00. 

In what follows, we shall assume that the distribution of w satisfies the re¬ 
quired restrictions for the 61 and 62 under consideration and that the expectation 
of w under the hypothesis 0 = 0i is unequal to the expectation of iv under the 
hypothesis 0 = 6 % Consequently, we shall assume that Lemmas I and II and 
the Fundamental Identity hold for all the sequential tests we shall consider. 


1.4 The power of the proposed test. Let Xj be an observation from in and 
X 2 an observation from ira. Let 


(1.401) 


'°®/(x2,0§)/(^r,0?) 


where 0? and 02 are specified parameters in the probability density of irj, and 112 
respectively. Furthermore, let j di, 82 ) = | 0i, 02 ) be the moment gen¬ 

erating function of 2 under the hypothesis (0i, 02 ). Then 


(1.402) F(e" I 01, 02) 


r rr /(x2,0? 

J—« _^(X2 , 02 


)/(xi, 02)' 


02)/(Xl, 


/(xi, 0i)/(x2, 02) dxi dxi. 


By Lemma II there exists one and only one real number h 0 such 
that Eie''‘ | 0i, 02 ) = 1 Let Lk = P{Zn < — 2? | 0i, 02 ) be the probability that 
the sequehtial test termmates and Zn< —h under the hypothesis (0i, 82 ) ■ Then 
by Lemma I, 1 — L/. = PiZn > a | 0i, 02 ). For any random variable u consid¬ 
ered under the hypothesis (0i, 62), let the symbol Ebiu) stand for the expected 
value of u under the restriction that Zn < ~i and Eaiu) stand for the expected 
value of u under the restriction that Zn > a. In terms of the above definitions, 
the Fundamental Identity can be expressed as follows; 

(1.403) | 0i ,02)]"" + (1 - L*)F„e'^"(.#,(f 1 0i, 02 )]-" = 1- 
Setting i = h in (1.403) we get 

(1.404) -f (1 - Z,A)F„e'^'* = 1. 

Following Wald [2], we define a two valued random variable 2„ in this manner; 
= oif Z„ > oand = —bif Z„ < —b. Let — Zn = e. Then «is also a 
random variable. In what follows, we shall substitute 0 for «. The error com¬ 
mitted in neglecting «is small when 0? is close to 0^. As we shall indicate later. 
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the quantity e can, in fact, be neglected without error in the special case where 
f{x, 6) is the binomial distribution. 

Substituting 2„ for in (1.404) we get 

,(1.406) Lke~'* 4- (1 — We*" = 1- 

Solving for Lh we get“ 

1 _ 

(1-406) Lk — 1 gAo gA(a-H>) _ 1 ■ 

As we shall see later, h = Q when ^1 = 6*. But when A « 0, W in (1.406) is 
mdeterminate. However, it can be easily seen that 

(1.407) lim Lk = —^ • 

It follows from (1.406) that the power of the test is constant for all Bi and 
which give the same root t = h. The quantity h is thus fundamental in this 
test, and as we shall see later, is an appropriate measure of the difference between 
and di for a large class of distributions. 

1.6 Method of determining the sequential test. Let z be defmed as in 
(1.401) and let Ml) = £(e" 1 flj, ^5) be the moment generating function of z 
under the hypothesis {$1, ^S), and let Mi) = 1 ®j) be the moment gen¬ 

erating function of z under the hypothesis (9°, 6 i), Furthermore, let a » P(2n 
= a I 01, flS) and jS = P(Zn = -6 | flj, 0i). Then by Lemma I, 1 ~ a » P(Zk 
= -A I 01, 0?) and 1 — ^ = P(Z„ = a [ 0^, 0?). Now, applying Wald’s Funda¬ 
mental Identity we have, 

(1.501) (1 - ct)e-‘%k[Mi)r’' + «e‘“Fu[<^i(0]"" = 1, 

(1.502) + (1 - 0)e'“W[«^(Or'’ = 1, 

where the symbol Eu stands for the conchtional expectation knowing that Z„ »= o 
and Eli stand for the conditional expectation knowing that Zn =* — 6; with both 
expectations taken under the hypothesis (0?, 0$). The symbols Eu and Ea 
are similarly defined but under the hypothesis (0$, 0?). Setting f *= 1 in (1.501) 
and « = -1 in (1.602), we get, in view of Corollary 2, Theorem 2 below, 

(1-503) (1 - a)6-* + ae“ =« 1, 

(1-604) iJe" + (1 - /3)6~“ = 1. 


“ In what follows, La will always stand for the probability that a sequential test will 
terminate with Z„ <-b. In any given problem, the interpretation of the event Z^<-b 
Will be clear from the context. 
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Now a = log A and —& = log B. Hence, equations (1.503) and (1.504) become 


(1.605) 

(1 — a) B + otA = 1, 


(1.506) 

A ’ 


or 



(1.507) 

A = ^ and a = log — 

- d 

a 

ct 

(1.508) 

B = —^— and 5 = log — 
1 — a 

“ a 

d 


From (1.507) and (1.508) we see 'that the sequential test is completely determined 
by the function z, which, in turn, is defined by and d \, and by the probabilities 
of makmg a decision for the two hypotheses (0?, 0?) and ( 02 , 0j). 

Once z is defined in terms of a specific (0?, 02), the probability that Zn < —b 
will be equal to 1 — a and the probability that > a will be a (if we neglect 
the fact that | |, at a decision point, might exceed a or b) for the totality of 

hypotheses (0i, 02 ) for which the moment generating function ii>{t ] 0i, 02 ) = 1 
when 4=1. A similar statement can be made for the corresponding hypotheses 
( 02 , 0i) for which the moment generating function will equal unity when t = — 1. 
Hence, we see that while the test is defined by specifying two points (0?, 0°) 
and ( 02 , 0?) in the parameter space, the pre-assigned risks a and 5 of making 
the correct decision will be approximately constant on the set of points for 
which the moment generating function equals unity when t = 1 and when t = 
— 1, respectively. This set of points usually will constitute a smooth curve. 

d 

If 01 = 02 , Lo = —(by 1.407). Hence, the probability of accepting tti 

will be close to ^ if a is close to b, and will equal ^ if a = h. But from (1.607) 
and (1.508) we see that a = h if a — Thus, if we construct a test which 
will give a probability of rejecting tti when (0j, 0?) is true equal to the probability 
of accepting xi when ( 02 , 0?) is true, we shall be accepting tti and ir 2 with equal 
frequency when in fact 0i = 02 . 

1.6 The average number of pairs of observations required to reach a decision. 
Let Ein \ 0i, 02 ) be the expected numlier of pairs of observations required to reach 
a decision under the hypothesis (0i, 6 i ). Wc shall show that 

( 1 . 601 ) Fill 101 , 02 ) = ^SLzJi^lh . 

Proof: Differentiating the Fundamental Identity, 

(1.602) E’c‘*''[^(i)r'' = 1, 
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with respect to t, we get® 

(1.603) E{zy^''m)r - - o. 

Setting < = 0, we get 

(1.604) EZn - </)'(0)E(n j di , 0a) = 0. 

But 

(1.605) EZ„ = a(l - La) " 6La 
and 

(1.606) = Ez. 

Hence, solving for E{n | 0i, 62 ) in (1.604) and sub.eitituting from (1.605) and 

(1.606) we get 

(1.607) Bin 1 e,, 0a) = 51^-“ . 

li/Z 

While La is approximately constant for all values of (Or, 0a) for which the mo¬ 
ment generating function equals umty for I = h the oxpeeted value- of n given by 
(1 607) will depend on the particular hypothesis (0i, 0i), This follo\\-H from the 
fact that Ez is not necessarily constant for the same set of points (0i, 0*) for which 
La is constant. 


1.7 Some general properties of the proposed test. 

Theorem 1. Let z = log observaiion from 

and xifrom %2. Then if F(z) is the distribulion density of z under the hypothesis 
( 01 , 62), Fi—z) is the disiribulion density of z under the hypothesis ( 02 , 0 |). 

Proof: Let t be a real number and let = Bie'“ 1 0i, 63 ) lx; the character¬ 
istic function of z under the hypothesis ( 0 i, 63 ). Then 


(1 701) 



el)f{x3, 0°) ~ 

f{x3, ei)f{xi, ej). 


it 


/(*1 , 0 l)f(X 3 , 63 ) dxx dX 3 . 


Now let i/' 2 (t) — E{e | O 2 , 0 i) te the characteristic function of —2 under the 
hypothesis ( 02 , 0 i). Then 


( 1 , 702 ) r ,,,, ... . 

J-« L/fe, 0?)/(a;i, 0?) J ‘ • 


hterchanging the variables of mtegration in (1.702) we see that MO * MO. 
Consequently, the distribution of 2 under the hypothesis ( 0 i, O3) is the same as 


The rluTCf Identity can be differentiated with respect to <. 

See STil plge m reference to the Funflamental Identity. 



SEQUENTIAL ANALYSIS 


131 


the distribution of —z under the hypothesis {fit, ft). This theorem in con]unc- 
tion with the fact that E(z 1 ft, ft) ^ 0 when ft 5 ^ ft shows that the decision 
function z discriminates in a real sense between the two alternative hypotheses 
(ft , ft) and (ft , ft). 

Theorem 2. Let E{e“ \ di, ft) be the moment generating Junction of z under the 
hypothesis (ft , ft) and let E(e“ | ft , ft) be the moment generating function of z 
under the hypothesis (ft, ft) Then, if t = his a root of the equation E{e“ \ ft, ft) 
= 1, then t = —his a root of the equation E{e‘‘ 1 ft , ft) = 1. 

Proof: The same as Theorem 1. As we have seen in Section 1.4, the power 
of the proposed sequential test (neglecting e) depends only on h This theorem 
shows that if the probability of accepting tti is large under the hypothesis ( 0 i, 
ft), it will be small under the hypothesis (ft , 61 ), and conversely. 

Corollary 1. The only value of i for which Eie“ \ 9, 6) = 1 is t — 0 This 
follows from Theorem 2. 

Corollary 2. The values of tfor which E{e‘‘ | 0?, 62 ) = 1 a,nd E{e‘‘ | ol , 0?) 
= 1 are t = 1 and t — —1 respectively This can be seen by expressing E{e“ \ , 

9 \) as a double integral and setting 4=1 
Theorem 3 Let u be the totality of points (ft, ft) zn the parameter space for 
which ft < ft . Then a necessary and sufficient condition that the values of h {for 
which E{e'‘^ 1 ft , ft) = 1 ) lie of the same sign for all points in w is that 

(1.703) Ew\d = £ log/(a;, 9 ) dx 

he a monoionic function of 9. 

To prove this theorem we need the following lemma. 

Lemma 1. Lei g{x, 9) be the distribution density of x and tp{i) its moment gen¬ 
erating function. Let h be the real non-zero value of tfor which \p{t) — 1 . Then 
the sign of h is opposite in sign to Ex {the expected value of x) if Ex 9 ^ Q 
Proof: For any random variable u, Wald [1] has shown that the inequality 

(1 704) Eu < log Ee'‘ 

holds 

Setting u = ix, where i is a constant, we get 
(1,705) tEx < log Ee‘^ = log ^(O- 

Setting t = hm (1.706) we get hEx < 0. This proves the lemma. 

Now let E{z I , ft) be the expected value of z under the hypotheses (ft, ft) 
where (ft, ft) belongs to co. Then 

- 81 ) dx = Ew\9t. - Ew\9i. 


(1,706) 
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From (1.706) we see that ilEw\d is monotonic m d, E{z i , 6 %) will have a con¬ 
stant sign,for all pomts (fli, 62 ) in and hence by Lemma 1, h will have a constant 
sign. Conversely, if h is of constant sign for all (0], 62 ) in w, so will E{z ! 0 ,, 
62 ) be. Consequently, by (1.706) Ew [ Q must be monotonic. 

COHOLLAEY 1. Let Ew\ 6 he a monotonic function of d and let w*, (/i 0), he 

the totality of points ( 0 i, 02 ) in the parameter space for which the power of the se¬ 
quential lest is constant. Then the coordinatss of the points (0i, &i) in m arc, such 
that either every 0 i < 02 or every 0 i >• 0 *. 

Proof: By assumption all points in us have the same power. Hlnce Lh in 
(1.406) is a strictly mcreasing function of h, the points in u* must yield the same 
h. However, if we assume that m contains a point (0i, 02 ) with 0[ < 02 and a 
point ( 0 " , di) with 0 '/ > e" , the sign of Eiz \ 6 [, 0^) by (1.706)^will Ixj opposite 
to the sign of Biz \ b'I , 0"). Hence, the value of h yielded by (0l, 02 ) is opposite 
in sign to that yielded by ( 0 '/ , 02 ^), which contradicts the assumption that both 
points yield the same h 

Theorem 3 and Corollary 1 show that if Bu) | 0 is monotonic in 0, the proposed 
sequential test is unbiased m the sense that all points ( 0 i, 02 ) that lie on the curve 
h =r constant (and hence have the same power) will Irnve the property that 
either the inequality 0 i < 02 holds or the inequality 0 i > 02 holds. The equality 
sign will hold if and only if h = 0 . 


1.8 The proposed test applied to distributions which admit sufficient statis*' 
tics. Let/(a, 0) admit a sufficient estimate of 0. Then it is well known that 
/(x, 0 ) can be ivritten in the form^ 

(1.801) fix, 0) = 

Setting a = log i we see that for this class of distributions the 

/(X2, 02)/(i:i, 0i) 

decision function assumes the simple form; 


(1.802) a=[u(x 2 ) - «(i,)][r( 0 ?) - r( 0 S)]. 

" r( 0 !) - r( 0 " 2 ) = r( 0 ;) - f( 0 J) • decision function 

becomes 


(1.803) z* = u(xi) — u(xi). 

We shall now show that, for this class of distributions, the power of the sequen¬ 
tial test is a function of «( 0 i) — «( 02 ), To prove this, it is only necessary to sliow 
that Eie“ | 0 i, ffi) equals unity for ( = t;( 0 )) — y( 02 ), Now 


(1.804) 


Eie“* 1 01 , 02 ) = r r e''“''*>-''<*>»/(xi, 0 ,)/(x 2 . 02 ) dx, dxt 

•^eo •^«o 





dxi dxi . 


If we set i — r( 0 i) viBf) in (1 804), we see that the statement is proved. 


* See, for example, [3]. 
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Let En | h be the average number of pairs of observations required to reach a 
decision when v(6i) — v(ds) = h Then by formula (1.607) we have 


(1.805) 


E(n 1 h) = 


a*(l - La) - b*LH ^ (1 - Lk) log A + U log B 
Elu^Xi) — w(a;i)] hE[u{xi) — u(xi)] 


Since the expected value of uix) will not necessarily equal v{d), the average num¬ 
ber of pairs of observations required to reach a decision will depend not only on 
v(Si) — v{d 2 ) but also on the particular hypothesis (0i, 62 ) considered. 

Since the power of the teat for this class of distributions depends on v{d\) — 
v(B 2 ), it will be constant for all di and which lie on the curve defined by v(di) 
— viOi) = constant. In particular, if the sequential test is defined with risks a 
and /3, the probability of accepting in (or irj) will be approximately a for all 
hypotheses (fli, 62 ) which lie on the curve defined by v^Oi) — v^di) = v{9l) — 
vidl) = ho and the probability of accepting xa (or tti) will be approximately P for 
all hypotheses { 62 , di) which lie on the curve defined by viOz) — u(fli) = ho. 
Now, the decision function z as well as the boundaries a* and b* will be identical 
for all sequential tests provided they are defined by the same risks a and /3 and 
the parameters 61 and 62 which determine the decision function all lie on the 
curve v( 0 i) — ^(^ 2 ) = ho . Since Wald [ 1 ] has proved that the sequential proba¬ 
bility ratio teat minimizes E(n), the expected number of observations required 
to reach a decision, when the hypothesis tested is true as well as when the 
alternative hypothesis is true, it must follow that in the ease under' consid¬ 
eration E(n) is minimized for all hypotheses ( 0 i, Bi) which lie either on the curve 
defined by u(fli) — v(&i) == ?io or on the curve defined by v(Bi) — v(Bi) — ho. If 
v( 6 ) is a monotonic function of 9, then the teat is unbiased (i.e. all points (9i , B 2 ) 
which lie on the curve v(Bi} ■— v( 62 ) = constant will have the property that either 
every 9i < 62 or every Si > 62 ). 

For this type of distribution, the importance of the difference between and 
B 2 may be measured by ti(9i) — v( 62 ). We shall now show that the function 
r(fii) — ^(^ 2 ) is an appropriate measure of the difference between these param¬ 
eters for a wide class of distributions which often occur in practice. 


1.9 The proposed test applied to known distributions. 

1.9a. The problem of discnminaling between means when the variances are known, 
Let f{x, p) be a normal distribution function with unknown mean ix and known 
variance er® which we shall assume, without loss of generality, to be unity. Let 
Xi bo an observation from m and xo an observation from ir 2 . Let the distribu¬ 
tion density of xi bo designated by f{xi, mi) and that of Xo by /(xj • 112 ). The prob¬ 
lem is to decide which process has the larger n- 
Since f(x, p) is a normal distribution, it is given by 

(1.901) = 

Hence f{x, p) is of the form considered in Section 1.8 with u{x) = x and v(,p) = 
p. Therefore, the decision function is given by 

(1.902) z* = X2 — Xi 
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and the power of the test depends on A = mi “ and is given by (1.406) with, 
a and b replaced by a* and b*, respectively. 

The sequential test is performed in the following manner: We take a pair of 

n 

observations, one from tti and one from ir*, in sequence. If at any stage ^ 

a"”! 

— xic) < -b*, we accept the hypothesis that tti has tho larger mean. If, how- 
ever, at any stage X) - i:i„) > a*, we accept the hypotliesis that jra lias 

a«*l 

the larger mean. If neither holds, we continue sampling. According to aeclion 
1.8 0 * = and — 6 * = ■ ^ , where gi — gs is assumed to lx> positive. 

Ml — Mi Ml — Mi 

In order to determine a sequential test, we must fix a* and b*. That is, we 
must fix the quantities mi — Mi > A, and B. This can be accomplished by de- 
cidmg. ( 1 ) the smallest difference between the means of the two processes which 
IS considered worth detecting, This determines Ao = Mi — M? > which we sliall 
assume to be positive: ( 2 ) the maximum probability a of rejectmg the hypothesis 
that in has the larger mean when in fact mi in iri differs from ms in tj Viy as much 
as Aq , and (3) the maximum probability /3 of accepting tho hypothesis that xi 
has the larger mean when in fact the difference between ms *Lnd mi is as large as 
Ao negatively.' When a and /3 are fixed, A and B are determined by equations 
(1.507) and (1.508), 

1 . 9 b. The 'problem of discnmincUing between variances when the mcam are knoim. 
Let us assume that the distribution of Xi in in and xs in sr* are normal with known 
means but unknown variances. We are required to choose that procesa which 
has the smaller variance. Without any loss of generality wc shall suppose tliat 
the means of xi and Xj are zero. Since /(x, ir) is normal, it is given by 

(1.903) -i- 

V 2ira- 


which is of the form considered in Section 1.8 with u(x) = x’ and V((r) = —i , 

2<r 

Hence the decision function z* is given by 
(L904) z* = xl- x\ 

and the power of the test depends on A = ^( 0 - 7 “ - vf) and is given by (1.406) 
with a and b replaced by a* and b*, respectively. The sequential test ia per¬ 
formed m the following manner: We take one pair of observations at a time, one 

from iri and one from 112 . Wo continue sampling as long as X (a:L — Xj^) lies 
between -h* and a*. Whenever X(a;L - xlf) > a*, we conclude that <rt > v?. 

at“l 


,,T ' defined by (1 406) is a monotonie function of A - . Hence the 

probabilUy of rejecting the hypotheeie that x, has the larger mean is ^ « whenever m. - M» 

fllateme Jt rn“h risk of making an erroneous deoision. A similar 

siatement can be made concerning the nskM. 
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Whenever 2 — ^L) < —b*, we conclude that o-j < a\. 

a»l 

o* and h* are defined by 




and 


-h* = 


log-g 




The quantities 


Thus a* and b* are defined by a specific value of o-j “ — o-i ^ and A and B. If Ave 

2 1 S 

take (cS) ^ — (ffi) ^ as negative, then A = - -and B — - - where a = proba- 

1 — q ; a 

bility of concluding that tr? < (s\ when in fact <j 2 ^ — (rr“ = — [(uj)"* — (cr?)”*] and 
/3 is the probability of concluding a\ < al when in fact = [((rS)““ — 

1 9c. The problem of discriminating between variances when the means are un¬ 
known. Let the measured characteristics in in and irj be assumed to be normally 
distributed with unknown means and unknown variances. We desire to choose, 
on the basis of a sequential test, that process which has the smaller variance no 
matter what the means are. This will be accomplished by reducing the problem 
to that treated in Section 1.9b. 

Let Xn , xn , Xn , • • ■ be the successive observations from iri and X 21 , Xti, x^ , 
• • • the successive observations from ira. Consider the transformation 


1 1 
yn = ^^Xn - 

11 2 


2/Un-l) 


1 1 n ~ I 

■\/n{n — 1) VnCn — 1) -s/nln — 1) ' 


with yw, yn, • • • J/sin-u ■ • • similarly defined in terms of aiji, * 22 , • • • a: 2 „ • • • . 
It is obvious that this transformation can be applied sequentially. Moreover, 
it is easy to show that 

(1) The expected values of the y’s are zero. 

(2) The variances of the y’a are the same as the variances of the k’s. 

(3) The y’s are normally and independently distributed. 

Hence we can apply the sequential test developed in Section 1.9b to the y'a 
without any alterations. The decision function Z* will be given by 

Z*n = E {yl - yla). 


(1.905) 
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But it can be easily shown that 


11 yla = (Xia — is)* 


llvu=il (a:ia - ii)* 

a»l a*-l 

where Hi and £2 are the arithmetic means of the observations in n and xj respec¬ 
tively. Hence (1.905) is equivalent to 


(1.906) 


n+l »*4*1 

Z* = 11 i^c — is)* — S (a;i« — ii)* • 

a—l <»"l 


Thus, to perform this sequential test, the population means need not be known. 
The only difference between the tests considered in 1.9b and 1.9c is that 1.9e 
requires one additional pair of observations.* 

1.9d. The 'problem of discriminating between means when the variates have a 

Poisson distribution. Let the distribiltion of in xi be given by-r— and 

Xil 

the distribution of X 2 in jtj be given by-j—• where Xi and xj each take on the 

X 2 I 

values 0, 1, 2, • • • . It is desired to test the hypothesis that the mean in xj 
is smaller than the mean in X 2 against the alternative that the reverse is true. 
Since the Poisson distribution can be written as 


(1.907) 


/(x,m) 


it is of the form considered in Section 1.8 with u(x) = x and v{m) = log m. 
Hence the decision function z* is given by 


z* = Xj — Xi 


and the power of the test depends on A = log —. The sequential test is per¬ 
formed in the following manner: We take one observation from n and one from 

ft 

in succession. If at any stage H (xa„ — xib) < — h*, we conclude that m 2 

n 

is smaller than mi. If 2 (^la — Xia) > a*, we conclude that Mi is smaller than 

f 

m 2 . If neither holds, we take another pair of observations. This process is 


• The method employed here was discovered independently by Charles Stein and the 
author as a solution to a different sequential problem 
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continued until one or the other decision is reached. The quantities o* and h* 
are given by 


(1.908) 




log Ua 


(1.909) 


h* = 


log 


1 - d 


a 


log ^^0 


where uo = m\/m\ which is assumed to be less than one, a. is the desired proba¬ 
bility of concludmg that Wj is smaller than mi when in fact ml/mt = Wo < 1, and 
/3 is the probability of concluding that mi is smaller than mz when in fact 
my ml — 1 /uq . The power curve is given by 


(1.910) 




- 1 ’ 


where u = mi/mj. 

1 9e. Double dichotomies!' We are given two processes n and 7r2, one yielding 
a fraction defective pi and the other pj. We sliall assume that pi and p 2 are 
unknown. We desire to choose on the basis of a sample that process which gives 
the smaller fraction defective. That is, we wish to devise a test which gives a 
high probability of accepting xi if pi < pa and a high probability of accepting 
n if p 2 < Pi. If Pi = P 2 , we might be more or less indifferent as to which 
process we select, 

Before we can answer this question, we must decide: (a) the minimum differ¬ 
ence between the two processes which we consider w'orth detecting; and (b) 
if the two processes differ at least by the amount specified in (a), the minimum 
probability mth which we desire to make the correct decision. 

In the proposed test, the decision function is given by z* = xt — Xi where 
Xi, (i = 1, 2), takes on the values 0 or 1, depending on whether the fth process 
yields a nondefective or defective item. The difference between the two proces¬ 
ses is measured by* u = (the ratio of the odds). It can easily be 

1—Pi 1—P2 

seen that when « < 1, pi < pi and when u > 1, pi > pj. If u = 1, pi = pa, 
Let Uo represent a quantity loss than 1. Furthermore, let a be the probability 

of accepting itj when in fact the point (pi, pz) lies on the curve —* == Uo; and 

qipt 

/3 be the probability of accepting vt when in fact the true point (pi, pj) lies on the 


'For a solution of a more general problem'in double dichotomies using a different 
approach, see [1], section 5 32 and [4] sections. 

* This follows from the fact that the binomial distribution can be written as /(z, p) = 
ei>o*(p/8)+iDjj where x takes on the values 0 or 1. Hence the distribution is of the form 
considered in section 1.8 withefp) = log p/g, w(p) = log g, and 2 * = ij — xi. 
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V2QI 

curve — = -Wo, 
32P1 


Once Uo, « and p arc chosen, we compute 


(1) a* 


1 ^ 

'°e—; 

log-Uo , 


and 

1 1 ~ P 
(2) -f'* = ' • 

log 1/0 

We then proceed as follows: We take one item from each process in sequence 
and cumulate the number of defective di in process wi and da io process . 
Whenever da — di < —b*, we choose process Tra. Whenever dt — di > ct*, 
we choose process tti . Whenever da — di lies between a* and b*, Vi'q take 
another pair of observations, one from each process. This procedure is con¬ 
tinued until one or the other decision is reached. 

1.9el. The exact value of the •power function for double dichotomies. Sinen 
da — da changes at most in steps of one unit, it must follow' that whenever a de¬ 
cision is reached at a*, the difference between a* and da — di ia cither zero (if 
a* is an integer), or the difference between a* and da ~ di is constant for all 
values of 71. A similar argument holds for b*. This permits U8 to compute the 
power function without any approximations. Let d be the next positive integer 
larger than a* if a* is not an integer, and d = a* if a* is an integer. Ixjt 6 lie 
the next positive integer larger than b* if b* is not an integer, and 6 “ b* if b* ia 
an integer. Then we see that the equation (1.406) for the power curv'e con Vie 
given without any approximations by the formula 

(1.9101) L« = (ii*+‘ - - 1) 

19e2. The exact average sample number for double dichotomies. Ixit Zn — 

da — di and let the pomt (pi, pa) be on some curve = u. Let E(n\pi,‘pi)he 

Pifli 

the expected number of pairs of observations required before a decision is reached. 
Let Lu = probability of reaching —5 (i.e., L„ is the probability that t* is ac¬ 
cepted). Then 1 — Lu is the probability of reaching d (i.e., 1 — I/„ is the prob¬ 
ability that TTi is accepted). Then by Wald’s Fundamental Identity we have® 

Cl-911) EZ„ = EsE{n | pi, pa). 

Now, F2 = Pa - Pi, and = -L„6 -f- (1 - L„)d. Hence 

(1.912) E{n 1 Pi, pa)'= 

Pi - Pi 

•Por a derivation of formula (1.911) which does not depend on the Fundamental Identity, 
see Wald [1], page 142. 
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It will be noted that while Lu depends only on u = , E{n \ pi, ps) depends not 

piqi 

only on the ratio of the odds but also on the difference between the two fraction 
defectives. 

1 9e3. The distribution of nfor double dichotomies. In this section we shall be 
concerned with the probability of reaching a decision with exactly n pairs of 
observations. 

Let a and h be two positive integers and let the sequential test be defined by 

n 

the decision function Z„ — ^Za where Za takes on the values —1, 0, and 1 with 

a-l 

probabilities Pi, Pi, and Pj, respectively. In terms of double dichotomies, 
Z* = (U — di where and di are the cumulative number of defectives obtained 
sequentially from ti and n, respectively, and Pi = piqi, Pi = fipi + gigs, 
Pa = psgi, where pi is the fraction defective yielded by iri and pj the fraction 
defective yielded by . 

By the Fundamental Identity we have for any t in the complex plane for which 

I 4>H) I > 1, 

(1.913) L,.e~‘%m)r + (1 - Ln)e‘*Ei[m"' = 1 

p 

where Lu is the probability that Z* = —b when pi and pt are such that ■—==«, 

Jri 

El and Ei are the appropriate conditional expectations, and 

(1.914) “ Pie"' + P 2 + Pac'. 

If we examine Wald’s proof of Lemma II [2], we see that <#>(t) > 1 for all real 
values of t which lie outside the open interval (0, h) where h is the root of the 
equation *= 1. Hence, it must follow that the Fundamental Identity (1 913) 
must also hold for all real values of I with the possible exception of the open in¬ 
terval (0, h). This fact will be used in the subsequent discussion. 

We slisll first obtain the distribution of n when o = <». From equation 
(1.910) we see that when a approaches », L„ approaches 1 for w > 1 and for 
tt < 1. We shall assume that u > 1. Then for t negative and o = w, the 
Fundamental Identity (1,913) becomes 

fl.giS) e-*^E[d>(t)]-” = 1 

or 

(1.916) Emr = 

Now for all u > 1, Pi > Pa, and hence Ez = Pt - Pi is negative. Since the 
real roots of ^(t) 1 are opposite in sign to Ez, it must follow that (1.916) holds 

for all t in the interval (— 00 ,0). Now set e' = x. Then (1.916) can be written 
as 

EiPi - + Pi+P^T" = 

X 


(1.917) 
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and (1.917) is valid for all x in the interval 0 < i < 1. 
Now set 


(1.918) Pi 1 + Pz + P^x K 

tC T 

Then for any specified value of r there will be two values of x, say Xi{t) and Xz(r). 
As T approaches 0, one of tliesc values of x will approach zero and the other 
infinity. Let xi(t) be the value of a: in (1.918) which approaches zero as r 
approaches zero. Substituting (1.918) in (1.917) we get 

(1.919) Sr” = [x(r)t 

But Et" is the generating function of n. Hence if we could expand .Er" as a 
power series in r, then the probability E* = — b in exactly n steps would be given 
by the coefficient of r". We are thus led to consider the expansion of [x(t)]’’ 
in a power series m t. 

We multiply (1.918) by rx and get 

(1.920) X = t(Pix^ + PiX + Pi). 

Then since Xi(t) approaches 0 as r approaches 0, we can expand [ji:i(r)]^ liy I>a> 
grange formula/** and get 

(1.921) [xi(r)]‘ = i: ^ (r‘(Pi + p= f + p. f’)"’JM 


where the expansion is valid for ii(r) sufficiently close to zero, Hence, if P»(b) 
IS the probability that exactly n pairs of observations are required to reach a 
decision, then 


(1.922) P„(b) = A + P, f + p, . 

Now 


(1.923) 


But 


dj"-* 


[t\Pi + P2i + Pzfri(-o 


= T P< y (n - z)I 

— t)l * ^jl(n — i — j)i 


p,pn-W 




wn+tf— 


(1.924) 

unless n = n -h 

(1.925) 


— £' 


n+f—y+6—1 






= 0 


— y + 6, i.e., y = f + 6, in which case 
d"-' 


dr--^^ 




t-0 


= (n - 1)1 


" See, for example, Mathematical Analysis, Vol. 1 (paragraph 189), by Goursat-Hedrlok. 
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Also, since the subscript j ranges from 0 to n — i, it must follow that j <n — i. 
Hence, f + b < n ~ i, or f Substituting (1.924) and (1.925) into (1,923) 

and simplifying, we get for P„(b) 


(1.926) 


P»(b) 


L (ft - l)!H(+‘Pr“"^P* 
^am+ 6 )!(n - 2» - 5)1 


where m « when n — 5 is even and m = -—^^ when n — 6 is odd. 

We shall now obtain the distribution of n when a is finite. 

As before, let Xi(r) and aJj(r) be the roots of the equation (1.918). Then from 
(1.913) we have 

(1.927) L«[Xi(7-)]~*Pir'' + (1 - LJ) [xa(r)]“P,r" = 1, 

(1.928) + (1 - L«) fe(r)]“P»r’‘ = 1. 


Solving for Eir" and Eit” from (1.927) and (1.928) we get 


(1.929) 


r j? " - [ai(r)a?a(T)]‘’[a:ii(T)‘* - a;i(r)'*] 
^^(r)*^ - Xi(t)«^ 


(K930) (1 - Eu)EiT * _ a:,(T)«-^* ' 

We shall first obtain the probability Q«(5) that Z* = -b. This is given by the 
coefficient of r" in the expansion of LuEit” in a power series in r. From (1.918) 

P 

we see that Xi(r)X 3 (r) = ^ . Hence we can write (1.929) as 


(1.931) 


LuEir" 


1 - a:i(r)“+^ 

V 1/ 


Applying Lagrange formula, we get for Q»(5) 

(1.932) Q»( 6 ) « ^ [(Pi + P« f + P 3 t’)’‘/'(t)]M 


where 



(1.933) 


/(() = 
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But /({) can be expanded in a power series in 

(1.934) «{) = g(pj « -{pj « 

Hence 

1 * /P \ **'■*■** 

Q^(b) = i L [(2fc' + l)b + 2fca] j 

(1.935) „ /p \ tb4-(JHl)n 

- A L [(2fc + l)b + (2fc + 2)a] f pM 
n\ k-a VI/ 

. (Pj + Pjf + P3t’‘)'’lE„n. 

df"-! 

Comparing (1.935) with (1.922) we see that 

(1 936) Q„(6)sPn(5) - Pn(& + 2a) + P«(3b + 2a) - • • • , 

the terms in the series being alternately of the form 
f5Y'’+*'’ P„[(2fc + l)b + 2fca] and 


- ^gy‘+<'=+«‘p„[(2fc + 1)6 + (2fc + 2)a], /or = 0. 1, ■ ■ • 


The series stops by itself as soon as the argument of P„ becomes greater than n 

If we compare (1.930) with (1.929), we see that the probability that Z* = a 
with exactly n pairs of observations is given by (1.936) with a and b interchanged 
and the result multiplied by (Ps/Pi)". 

It is to be noted that the problem of double dichotomies is similar to the fol¬ 
lowing problem in games of chance. Tavo players A and B, possessmg o and 6 
dollars, respectively, are playing a game of chance which admits a draw. The 
stake is one dollar per game. The probability that A will win one dollar is 
Pi, the probability that B will win one dollar is Pi and the probability of a draw 
is Pi . In terms of this game, L„ given by (1.910) is the probability that B 
Avill be ruined in the long run, and Qn{b) in (1.936) is the probability that B Avill 
be ruined in exactly n games. 

For a discussion of games of chance which do not permit a draw, see Introduc¬ 
tion to MathematicalPrubahility, Chapter VIII, by J. V. Uspensky. The develop¬ 
ment presented above is in some respects similar to that given in Uspensky’s 
book In Part II, Ave shall give a different and more general approach to the 
problem of derivmg the distribution of n for sequential tests m which the variate 
takes on a finite number of integral values. 
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AN APPROACH FOR QUANTIFYING PAIRED COMPARISONS AND 

RANK ORDERS 

By Louis Guttman 
Cornell University and War Department 

1. Summary. Research for the Army demobilization point system evolved 
a new approach to paired comparisons and rank order. Each of N individuals 
compares or ranks n things, the problem is to determine a numerical value for 
each of the n things that will best represent the comparisons in some sense. The 
new criterion adopted is that the numerical values be determined so as best to 
distinguish between those things judged higher and those judged lower for each 
individual. Least-squares is employed m the analysis, and the solution appears 
in the form of the latent vector associated with the largest root of a matrix ob¬ 
tained from the comparisons or rankings. 

This approach applies to the conventional problem of ordinary comparisons, 
the numerical solution being easily obtainable by simple iterations; the conven¬ 
tional use of hypothetical variables and unverified hypotheses is avoided. The 
Army point system is an example of a new and more complicated class of prob¬ 
lems; the same principle for the solution applies here, only more details occur 
in the derivations and computations. 

2. Introduction. The problem of paired comparisons arises when it is desired 
to obtain numerical values for a set of a thmgs, with respect to one characteristic, 
such that these values will represent the judgments of a population of N in¬ 
dividuals. 

One procedure for obtaming the judgments is to have the individuals compare 
the things two at a time and to judge for each comparison which of the two 
things should be given the higher rank. An alternative procedure is to have 
each individual rank all the n things simultaneously. Such a ranking implies 
judging all the n(n — l)/2 comparisons at once; hence, the two procedures are 
substantially equivalent. Two noteworthy differences between the procedures 
are: (a) comparing two things at a time allows inconsistencies to appear within 
judgments of an mdividual, and (b) it is sometimes harder in practice for people 
to judge n things simultaneously than to compare them two at a time. 

The problem of quantification, of course, is identical for both procedures, so 
we do not distinguish between them in this paper. The judgments vary from 
person to person (and possibly within a person), and the problem is to determine 
a set of numerical values for the things being compared that will in some sense 
best represent or average the judgments of the whole population, 


‘Adapted from Report D-3, “An approach for quantifying paired comparisons,'’ Re¬ 
search Branch, Information and Education Division, Headquarters Army Service Forces 
Washington, D C , 1945. ’ 
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In some situations, the things bemg compared may be single items or objects, 
this we shall call the case of ordinary comparisons. In other situations, the 
things may be comhinaiims of items or objects. 

This paper is devoted to the presentation of a general approach to quantifying 
comparisons or rank orders, with particular application to ordinary comparisons 
and to the comparison of combinations of two things It seems to differ from 
previous approaches in at least two important respects: (a) it is based on but one 
simple principle, namely, that the quantification shall be the one best able to 
reproduce the judgment of each person in the population on each comparison; and, 
as a consequence, (b) the approach yields solutions not only to the traditional 
case of ordinary comparisons, but also to more complex cases that do not seem 
to have been discussed previously. 

An example' of a major practical use of this approach is wiyi respect to the 
demobilization score card of the United States Army. The problem was to 
determine the number of points to assign each of the variables on the score card 
accordmg to the opinions of the soldiers themselves. The research on this was 
based on a form of paired comparisons more complicated than the ordinary one, 
and had additional complications of curvilmearities of various sorts in the data. 
Our approach handles such problems as well as the problem of ordinary com¬ 
parisons. 

Let us describe the score card problem in somewhat more detail. In a survey 
of enlisted men throughout the world by means of a questionnaire administered 
by field teams of the Research Branch, it was found that there were five variables 
that the men thought should receive consideration on the score card to determine 
order of demobilization: length of time in the Army, length of time overseas, 
amount of combat, age, and number of children. 

The problem now was to determine how much weight to give eacii of these 
variables in obtaining total scores. According to ordinary paired comparisons, 
one would ask, for example, “Who should get out first after the war; a man 
who has two children or a man who has been in two battles?’’ But respondents 
refuse to judge such a comparison because the battle experience of the first man 
is not specified, nor is the number of progeny of the second man, so that there is 
insufficient basis for judgment. 

Therefore, in the actual research, judgments were asked on each of ten com¬ 
parisons put in the following form: 

“Here are three men of the same age, all overseas the same length of time. 
Check the one you would want to have let out first: 

-A single man.... through two campaigns of combat 

-A married man with no children .... through one campaign of combat 

-A married man with two children ... not in combat.” 

Each variable was compared with every other one in this fashion. 

The equations were derived for computing the relative number of points to 
assign to each month in the army, each month overseas, etc., which would be 
most consistent according to our principle. These are essentially the equations 
developed in section 6 of this paper. 
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The results showed strong curvilinearities in the men’s judgments. Amount 
oi comhat received one amount of emphasis when compared with age, and another 
amount of emphasis when compared with number of children. Since the score 
card would be too complicated in practice if curvilinear scoring were used, 
equations were derived for the linear scoring scheme that would be most con¬ 
sistent according to our principle. These are essentially the equations derived 
in section 7. The weights arising out of the research were computed from such 
equations. 

The variable age received a slight negative weight, which justified dropping 
it from the score card. The weights the Army finally adopted for the i^maining 
factors were modified from the research weights, hut yield essentially the same 
results as the research weights. Demobilization scores obtained from the one 
system of weights correlate very highly with scores obtained from the other. 

It can now be revealed that the Army’s modification was essentially to reverse 
the weights for children and battles. In subsequent attitude surveys on how 
well the soldiers liked the point system [8], a major complaint was found to be 
that battles got too little weight compared with babies I 


3. The basic principle. Our basic principle in deriving numerical values—let 
us call them “aj-values”—for the things being compared requires that the a;- 
values of things a given person judges higher than other things should be as 
different as possible from the i-values of the things he judges to be lower than 
other things. This will be achieved if we make the x-values of tilings judged 
higher as homogeneous as possible among themselves, and the i-values of things 
judged lower as homogeneous as possible among themselves, for each individual. 
In the language of analysis of variance, our principle calls for minimizing the 
variation'within individuals, compared with that within the group as a whole.* 
The resulting i-values will tend to be the best for reproducing the judgment of 
each individual on each comparison with a minimum overall proportion of 
errors of reproduction [3, pp. 342-343]. The smaller this overall proportion of 
error, the better the quantification represents the data. Least squares is used 
for convenience for measuring variation in deriving the equations. 

The previous literature, on ordinary paired comparisons,* seems to liave 
concentrated largely on the problem of estimating the differences between means 
of hypothetical variables assumed to underlie the judgments. Thurstone has 
shown that by using assumptions of normality of distribution, equality of vari¬ 
ances. and zero correlations among hypothetical variables, it is possible to 
estimate relative distances between means for some kinds of data. 


™ suggested by previous work on scale analysis; 

see [31 This theory has been developed further by the definition of a perfect scale in 
equations for the perfect scale have interesting properties that may be related 
to paired comparisons, these equations are being prepared for publication. The referees 

have called my attention to related work on quantification by R A. Fisher in [1, p 283] 

217-^T FnrT^ the previous work, including that of Thurstone, is given in [2, pp. 
zii ji43I. lor more recent work, see [7] i 
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The problem of estiinatuig differences between means is not identical with 
that of reproducing individual judgments. For example, it can be shown, 
within the same framework of hypothetical variables conventionally used, that 
if variances are unequal and/or coirelations are unequal then the means of the 
hypothetical variables are noi in general the best quantification for reproducing 
individual judgments; the principal axis of certain product-moments of raw 
scores is the best quantification. It is m the special case where variances are 
equal, and where correlations are equal—^not even necessarily equal to zero— 
that the principal axis is the set of means. Proof of this is given in the appendix. 

The approach of this paper does not use hypothetical variables, but inquires 
directly as to what numerical values can be derived from the observations that 
will beat reproduce those observations. 

In the next section is treated the case of ordinary comparisons. The more 
complicated problem of the demobilization score card is formalized in section 5, 
and the equations for its unrestricted solution are derived in section 6. Since 
the unrestricted solution brings out curvilmearities that may be present, and 
since the score card in practice required a linear scoring scheme, equations for 
the most consistent linear quantification are derived in section 7. These are 
essentially the equations used in the research on the weights for the score card. 

The appendix shows a distinction between the conventional principle of 
estimating mean differences of hypothetical variables and the present principle 
of representing the comparisons of each individual. 

4. The case of ordinary comparisons. Paired comparisons as treated in the 
literature seem concerned largely with the ordinary case where separate things 
arc compared, rather than where combinations of things are compared. Our 
principle covers the ordinary case as well as more complex cases, and we shall 
treat the ordinary case first since it involves less details. 

Let 0i, , ■ • • , 0„ be the n things to be compared, where the assigning of 

subscripts is arbitrary. Each of N individuals is asked to make judgments of 
the form that 0, is higher than (or lower than) Ok . For convenience, we assume 
the rules of the experiment to exclude judgments of equality. We shall also 
assume that all people compare all the pairs. Hence, there are N sets of n(n — 
1) /2 comparisons. Considering each comparison as comprising two judgments— 
one of “higher than" for one object and one of “lower than” for the other—^there 
is a total of Nn{n — 1) judgments in the experiment. 

The judgments of all the individuals on all the comparisons can be represented 
compactly as follows. Let 

1 if individual i judges 0, > 0* 

(4.1) Biik = 0 if individual % judges 0/ < 0* 


Oy = fc. 

The ranges of subscripts, whether free or dummy, will always be; 
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(4.2) 


i = 1,2, ,N 
j,k = 1,2, ■■■ ,n, 


so that the ranges will not be explicitly stated again. 

Definition (4.1) implies that if e.,* = 1, then e,it/ = 0, and that 

(4.3) e,jk + Cikt =1, (j 9^ k). 

Let f,j be the number of things individual i judged to be lower than Oy, and 
jet g,, be the number of things ha judged to be higher than Oy. Then 

(4 4) fij s X) a.yfc, flu ^ £ eyfc, . 

h k 

From (4.3) and (4.4), we have 

(4.5) fii g,, = n ~ 1. 

Let F be the total number of comparisons made by each person; then 
(4 6) F = n(n-iy2^'Zf,k^j:g.k. 

k k 

Let c be the number of times each Oy was judged in the whole experiment, and 
let C be the total number of judgments in the experiment: 

(4,7) c = f^(n - 1) ^ E (/y, + ff.y), C = Nnin - 1). 

Both c and C count each comparison as two judgments, one of “lower than’’ 
and one of “higher than.” 

The means and variances to be considered are defined as follows. Let X/ 
be the numerical value to be derived for Oy on the basis of the comparisons. 
Let J, be the mean of the x-values of the things individual i ranked higher than 
the other things, weighted by the respective frequencies of the judgments, and 
et y, be the sum of squares of deviations from their mean of these a:-values: 


S-8) 

t h 

fli = Z (a:* - = Z xl},K - t; F. 

k k 

Similarly, let u, and a.- be the mean and sum of squares respectively for the x- 
values of the things individual i ranked lower than other things: 


<«») 

P K 

(4.11) ® ? (='* “ fl-* =T,xl ga - u? F. 

* k 


Let y be the mean of all the x-values in the experiment, and let W be the 
or squares of deviation from their mean of the a;-values ■ 


sum 


( 4 . 13 ) W = Zixk-Vfo=c2:xl-rc. 

* k 
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W is the total sum of squares for the experiment. Let J? be the sum of squares 
between individuals, amd let S be the sum of squares within individuals; 


(4.14) R [(«.• “ Vf + (t/i - Vf]F = (1? + u^.) - V^C, 

» { 


(4.16) 5 = L (l/« + a,) = TT - K. 

I 


Our principle is to quantify the judgments by obtaining the i-values that will 
minimize the variation within individuals compared to that of the group as a whole. 
This means making S as small as possible compared with W, which is equivalent 
to making R as large as possible compared with W. 

Therefore, if we define the correlation ratio E by 

(4.16) £' = 1 - <s/Tr, 

the problem is to determine the Xf that will maximize 
A convenient formula for E^ is, from (4.16) and (4.16), 

(4.17) E* = R/W. 

Since Ef is invariant with respect to translations of the a:-values, we can without 
loss of generality set 

(4.18) 7 = 0. 


Then we can write from (4.14) and (4.13), respectively, 

(4.19) fi; = (<* + u\) 

% 

(4.20) W = cT, 4. 

k 


To find the maximizing values xi for E^, we differentiate the right member of 

(4.17) with respect to the x/, set the derivatives equal to zero, and obtain the 
stationary equations 


(4.21) 


.dW 

T~ = ^ 
dx, dx/ 


The derivatives of R can be evaluated by differentiating the right member of 
(4.19) with the aid of (4.8); 

S2i 2 

(4.22) ^ S »» 2 + gt, gn). 

From (4.20), the derivatives of T7 are 
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then (4.21) ca.n be re-written, from. (4.22), (4 23), and (4.24) as. 

(4.25) E =-S’®) • 

Equations (4.25) are the equations to be solved numerically for the maximizing 

Before indicating a procedure for the numerical solution, let us first verify 
that a solution of (4.25) will satisfy (4.18). Summing both memlxirs of (4.25) 
over j, and using (4.24) and relations among the notation previously defined, 
we get 

A- ; 

or, from (4.12), 

(4.26) (1 - E^)V = 0. 

Therefore, if E'‘ 1, we must have 7=0. Since a perfect correlation ratio 

will not in general occur in practice, condition (4.18) will in general be satisfied 
by a solution of (4,25). 

There is always a trivial solution of (4.25) for which is formally equal to 
unity. This is x, = 1. For this trivial solution, li = Ui ^ 1\ R = ]V ~ C\ 
jE° = 1; and (4 25) is satisfied, Of course, E is not an actual correlation ratio 
for this trivial solution. 

The non-trivlal solution of (4.25) can be carried out with the aid of matrix 
algebra. Let x be a row vector of the nelements Xi , and let Hl)e the nXn sym¬ 
metric matrix l|ff/j,l|. if is not only symmetric but Gramian, since its ele¬ 
ments are product sums. Now (4.25) becomes the matric equation 

(4.27) xH = E^x . 

Equation (4.27) shows that x is a latent vector of H, and E^ is a latent root to 
which this vector corresponds. Since we want the largest possible correlation 
ratio, we seek the largest of the non-tnvial roots. If the two largest non-trivial 
roots are not equal, which should be the general case in practice, then there is a 
unique vector associated with the largest root which is the solution to our 
problem. 

The numerical solution of (4.27) can be carried out by the simple iterative 
technique for latent roots and vectors (see, for example [0]). The iterations 
converge m general to the vector associated with the largest root. To avoid 
convergence to the trivial solution (which formally has the largest root), the 
trial vectors should be adjusted to satisfy (4.18), then they will converge in 
general to the vector associated with the largest non-trivial root. 

A good way to choose a first trial vector is first to guess what the rank order of 
the x-values will be. Let T; be the guessed rank of X;, the Tj comprising the 
integers from one to n. If n is odd, then as the first trial x, use r, — (n -|- 1) /2, 
If n is even, then as the first trial Xj use 2 r, — n — 1. 
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A marginal check on the internal consistency of the judgments of the popula- 
tim is to c(^pare each difference {x,- — Xk) with the corresponding difference 
~ ^ C|jr,). If the population’s judgments are sufficiently consistent, 

the signs of the two differences will be alike for all the comparisons. 

is the frequency with which 0, is judged greater then Ok, and can be used as a 
basis for guessing the ranks of x, and a;*. 

6. Comparing combinations of two things. The problem of the score card is 
but one example of a class of problems that can be formalized as follows. Con¬ 
sider a set of n items, where the ^th item has m, categories. Let 0,p be thepth 
category of the jth item, (p = 1, 2, • - • , m,; i = 1, 2, • ■ • , n). The 0,p may be 
either qualitative or quantitative, and the order of subscripts assigned the 
categories can lie arbitrary. 

Each of N individuals is asked to make judgments of the form that the com¬ 
bination (0,p, Okr] is greater than (or less than) the combination (0,g, Oj,,). 
We shall assume that all people compare each of the pairs of combinations, and 
that the rules of the experiment exclude judgments of equality. 

The judgments of all the individuals on all the comparisons can be repre¬ 
sented compactly as follows. Let 

1 if individual i judges (0/p, Oir) > (0„, Ok.) 

0 otherwise. 

Here and throughout this paper the ranges of subscripts, whether free or dummy, 
will always be as follows: 

f = 1,2, ... 

(5-2) A: = 1, 2, ... , n 

P) 9) 1", s = Ij 2, •.. , w,, (or rtik, as the case may be), 

BO that the ranges will not be explicitly stated again. 

Definition (5.1) implies the symmetry 

Ciifc/jjriji = ^ik,/rp,tq j 

and that 

0 if individual i omits the comparison of (0„ , 

. _ Okr) with (0„ , Okd 

1 if he judges these two combinations to be 
unequal. 
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Additional notation is defined as follows. Let o,i,k/pr bo the number of com¬ 
binations individual i judged to be lower than {Oip , Okr)i and let b,,k/pr be the 
number of combinations he judged to be higher than (Ojp , Otr): 


(5.5) 

(5.6) 


flijt/jjr = 2 ^2 — Oikj/rp 

a • 

btjk/pr — £ 5^ ®ti*/a»rpr — b,icj/rp • 


Let Cik/pr be the number of comparisons for all individuals involving (Ojp , Okr): 
(5.7) C,k/pr = S ((itik/pr + b„kjpr) — Ck,/rp ■ 


Let Lfpbe the number of times that Oj„ occurred in combinations that were judged 
to be higher than other combinations by individual i, and let ^, 7 p be the number 
of times 0,p occurred in combinations judged lower than others: 

(5.8) /»)P = “ S 2 ^ikj/rp , 

k r k r 

( 5 * 9 ) ffijp = ^2 ^2 btjk/pr — ^2 biki/rp • 

hr k r 


Let be the total number of times in the entire experiment that Oip was 
judged: 


(5.10) 


Xj {ftir "b 0 Wp) ^ £ Xj ®)*/pr 


Let F be the total number of comparisons made by each person, and let C be 
the total number of judgments in the entire experiment (a comparison com¬ 
prises two judgments, one of “higher than” and one of “lower than”); 

(5.11) F ^j:Zh,p^l2I2onp, 

ip IP 

(5.12) C = Z £ Art = 2AiF, 

i p 

The means and variances required for the problem are defined as follows. 
Let irt be the numerical value to be derived for O/p from the judgments. Let 
U be the mean of the i-valuea of the combinations individual i judged to be 
higher than other combinations, weighted by the respective frequencies of such 
judgments, and let w, be the analogous mean of combinations judged lower than 
others; 


(5.13) s 4X) E X) X) (a:,, + a:*r) o,,*/pr ^ | E E Xkrfikt, 

^ 1 k p r r k T 

(5 14) W. = i E E E E {x,p -b a:*.) b.jk/pr ^ | E E Xkrgtkr . 

Let yi be the sum of squares of deviations from their mean of these “higher 
than” i-values, and let z, be the analogous sum of squares for the “lower than” 
x-values: 
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» 

(5.15) 


(5.16) 


y. = Z S £ S (^;V ^kr ^i) ^likfpr 

i k p r 

s Z Z Z Z (^,p Ofijkjpr tiP\ 

j k p r 

2. = Z Z Z Z (■»;> “f" 3Ifcr W{) 5,/t/pf 

I k p r 

~ Z/ iZ Z^ (^-jp "1" I*r) ^ijklpr ~ Ut F, 

1 k p T 


Let F be the mean of all x-values, weighted by their respective frequencies 
in the entire experiment, and let W be the sum of squares of deviations from 
their mean of these z-values: 


(5.17) F ^ 23 Z^ 23 Z^ i^ip “h ^kr)cik/pr — ^ Z Z , 

kj I k p r b i r 

>F = Z Z Z Z (x,p + a;*. - F)^c,*/„ 

) k p r 

- £ E 2 2 - y‘c. 

i k p r 

W IS the total sum of squares for the experiment. Let R be the sum of Squares 
between individuals for the experiment, and let S be the sum of squares vnlhm 
individuals: 


(5.19) ii! = Z [(b - Vf + («, - F)V = F Z (4 + «?) - rc, 

i < 

(5.20) S =T.{y<^-Zi) = w - R. 

< 

Our principle for quantifying the judgments is to derive the i-values that will 
minimize the variation mlhin individuals compared with that within the group 
as a whole. This means making S as small as possible compared with IF. 
Therefore, if we define the correlation ratio E by 

(5.21) S' = 1 - S/W, 

our problem is to determine the Xjp that ivill maximize £?'. 

A convenient formula for is, from (5 20) and (5.21), 

(6.22) = R/W. 

Since S' is invariant with respect to translations of the a:-values, we can 
without loss of generality set 

(5.23) F = 0. 

Then we can write, from (5.19) and (5.18) respectively, 

(5.24) 72 = F Z (<J + M?) 

% 

TF = Z Z Z Z (®JP + Xkrf C,k/fr ■ 

i k P r 


(5.25) 
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6. The uniestricted maximum. To find the maximizing x-values for E®, 
we differentiate the right member of (5.22) with respect to the Xip and set the 
derivatives equal to zero This yields the stationary equations 


( 6 . 1 ) 


dR .dW 
— = JS*— . 


To evaluate the partial derivaiives of R, we differentiate the right member o f 
(5.24), using (5,13) and (5.14), and obtain 


(6.2) 


- — Ki (.fiipfikr “b QtjpQikr)- 

aXfp I' k r , 


Similarly for W, we differentiate the right member of (5.25) and obtain 


(6.3) 


dW 

dx,p ~ + 


S ^krCjkfpr) 
L r 


From (6.2) and (6.3), (6.1) can be written as 


(6.4) 


•I'lfcrhjA/jr — (.X^jpAjp "j” ^3 X^krC]k/pr)j 

k T k r 


where 


(6 5) 


h-iklpT — if»pf>kT "b Cop^lir). 


The numerical solution of the x-values is to be obtained from (0 4). 

Before showing a procedure for the numerical solution, let us verify that a 
solution of (6.4) will also satisfy (5.23). Summing both members of (6.4) ovoi 
j and p, and using (6.5) and relations among the notation laid down in the pre¬ 
vious section, we get 

23 53 ^krAkr = ^fX53 53 XipAip + 53 53 ^krAkr) 

k r IV hr 


or 

(6.6) (1 - B^)E E XkrAkr = 0. 

h r 

From (5,.17), this can be written as 

(6.7) (1 - B*) F = 0. 

Therefore, if B* 9 ^ 1, we must have F = 0. Hence, any solution of (6.4) which 
does not yield a perfect correlation ratio must have a weighted mean of zero for 
the x-values. Since a iierfect correlation ratio will not in general occur in 
practice, condition (5.23) will m general be satisfied and is no restriction. 

It should be noted that there is always a trivial solution for which B* is for¬ 
mally equal to unity. The trivial solution is to set a:,p = 1, Then s «,■ = 2; 
B = ■RT = 4C, E* = 1 , and (6.4) is satisfied since it reduces to (6.7). For this 
trivial solution, B is of course not an actual correlation ratio. 
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The non-trivial numerical solution of (6 4) can be carried out in practice with 
the aid of matrix algebra Instead of regarding the x,p as elements of a table 
with n rows with m, elements in the jth row, consider the rows of such a table 
placed end to end to form a single row of Af = wi, elements Denote this 

3 

as the row vector x. Correspondingly, consider the values h,h(pr arranged to 
form the elements of a symmetric matrix H oi M rows and columns; consider 
the M values A,p to be the diagonal elements of an M X M diagonal matrix A; 
and consider the values of Cjk/pr arranged to form unM X M symmetric matrix C. 
Let X = Then (6 4) becomes in matric form. 

(6.8) xH = X(xA + xC) = \xiA + C). 

In the next paragraph it is shown that, in general, (.4 + C) is non-smgular, 
so that it has an inverse by which the members of (6 8) can be postmultiplied, 
yielding 

(6.9) xH(A + C)~‘ = Xx. 

This shows that x is a latent vector of H(A + ^1)~\ and X is the latent root to 
which this vector corresponds. Since we want the largest possible correlation 
ratio, we seek the largest of the non-trivial latent roots. If the two largest non¬ 
trivial roots are not equal, which should ordinarily be the case in practice, then 
there will be a. unique latent vector associated with the largest root. 

It is of interest to show that all the latent roots of H(A -f- C)“^ are real and 
non-negative, and that all the latent vectors are real. First, we notice that H 
is Gramian, for its elements are product sums. To see that .4 C is Gramian, 
we notice that from (5.18) and (5,10), 

( 6 . 10 ) W = 2j^'E,x'',pA,p +2^. ^ ^ y ) y ) Xkr Ctkl-OT "V Cj 

IP 3 k p r 

or, in matric notation, and transposing members, 

(6.11) 2x(4 4- C)x' = W+ V^C. 

Since IF is a sum of squares, the right member is clearly non-negative; and hence 

(6.12) x(4 -t- C)x' 0, 

for all X. Thus, 4 -|- C is nonnegative-definite, or Gramian Furthermore, 
4 4- C is in general nonsingular, because according to (5.17) and (5.18), V and 
W cannot vanish simultaneously unless 

(6 13) (p^jp 4" jk/pr — 0 

If n ^ 3, then (6 13) will ordinarily imply that a:,, s 0, that is, the equality in 

(6.12) will hold if and only if x = 0. In such a case, 4 -f C is posj7i«e-definite, 
or is nonsingular as well as Gramian, and possesses an inverse. 

As is well known, the inverse of a Gramian matrix is Gramian (see [5, p 71], 
for example), so that (4 4- C)“‘ is Gramian. That the latent roots of H{A 4- 
C)~^ are all nonnegative follows from a general theorem that the latent roots of 
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the product of two Gramian matrices are always nonnegative [5, p. 110] The 
proof of this is brief, and will be repeated here in a little different variation in 
order to prove in addition that the latent vectors are all real. Let G be a sym¬ 
metric square root of 4 -f C, so that G* = C. If we postmultiply both 
members of (6.9) by G, we can write the results as: 

(6.14) (xG)(G"*/IG-‘) = \{xG). 

This shows that xG is a latent vector of G~^HG~^ corresponding to the root X. 
But is symmetric, and in fact Gramian, for it can be written in the form 

(G“'A)(G“^X)', where KK' = H. Hence, each X is nonnegative, and each 
xG is real, whence each x is real 

The numerical solution of (6.9) can be carried out by the simple iterative 
technique for latent roots and vectors (see, for example, [6]). The iterations 
converge in general to the vector associated Avith the largest root. To avoid 
convergence to the trivial solution (which formally has the largest root), the 
trial vectors should be adjusted to satisfy (5.23); then they ivill in general 
converge to the vector associated with the largest non-trivial root. 

A marginal indication of the internal consistency of the judgments is the 
agreement in sign of 

(^Ip "(■ Xir) (Xjq -f- Xjij) 

with 

S,)k/pr,<ia ®0k/(is,)ir 9 

‘ i 

for each of the comparisons. If one combination ia judged higher by more 
people in comparison with another, then its x-values should exceed those of the 
other for marginal consistency. 


7. The maximum under certain linear restrictions. In the previous section, 
no restrictions were placed on the a;,„ in maximizmg jE' For some problems,’ 
the may be quantitative, and it may be desired within each item to keep the 
distances between the S/p proportionate to the distances between the 0,p . This 
was the case for the score card, where a linear system of weighting had to be 
used to be practicable for the army. It was necessary to derive a constant 
multiplier for length of service, a constant multipler for time overseas, etc,, 
even though there rvere curvilinearities in the judgments. 

Our principle enables us to handle such restrictions just as well as the un¬ 
restricted case. We shall derive the set of multipliers which is most consistent 
or the judgments in the sen^ of least squares. The ordering of categories 
Withm an item will no longer be considered arbitrary. Instead, subscripts will 
^ assigned m a fashion to make (0„ - 0„) proportional to tp - q) vHthin 
each For convenience, the subscripts can be assigned beginning from zero 
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The linear restriction is to determine a:-values in the form 


(7.1) 

+ vnt , 

where the f, and the i?, are now the basic unknoivuis to be solved for to maximize 
It is the Tjj that are of interest, for they will be the multipliers; but the.^j 
have to be used in the analysis to help determine the multipliers even though 
they are only additive constants that wiU not affect the order of total scores of 
people. 

To maximize under the linear restrictions, we differentiate the right mem¬ 
ber of (5.22) with respect to the and the ij,, set the derivatives equal to zero, 
and obtain the stationary equations 

(7.2) 

SR 

(7 3) 

dfi ^ ^2dW 

Sri, St), 

In order to evaluate the indicated derivatives, it is helpful to introduce some 
more notations. Let: 

(7.4) 

lo,\h — ftkr f V2o, ik — 2!!) fftkr 

r r 

(7.5) 

ll,%k — Tftkr t ^9*kr 

r r 

(7.6) 

da,]k ^ P Ojk/pr 

V r 

(7.7) 

(^11,jk ” P'^^jkfpr — 

p r 

(7 8) 

■^a,; “ 2^ V = 23 

k p r K 

(7 9) 

^ -^23 Go,<7^0,.A; + W0,i7^D,,'fc) 

(7.10) 

r % 

(7.11) 

1 

h2,\K “ "ri 23 TTli.ifc). 

r 4 


It is important to notice that do,,k = do,*,, but that d,.,* di,*,, Similarly, 

Ao.,* — ho,ki and h 2 ,,k = Aa.ty, but Ai.,* ^ Ai,*,. 

To evaluate the derivatives of R, it is helpful to re-write the right members of 
(5.13) and (5 14) by means of (7.1), (7.4), and (7.5): 

(7-12) ^ (5*^.»t + Vkll,xk) 

r k 

2 V 

w.’= pS (fi’Wo,,* d" ’nkfni.tk) 
r k 


(7.13) 
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Differentiating the right member of (5.24) rvith respect to the and the t], re¬ 
spectively with the aid of (7.12) and (7.13), and using (7.9), (7.10), and (7.11), 
yields 

dR 

(7.14) ^ = 822 i^hKik + Vh hi,k,) 

of, t 

dR 

(7.15) T" - 8^ (^kh.ik + Vkhi.tk)- 

For the derivatives of W, we re-write (5.25) using (7.1): 

(7.16) F = 2 2 Z) £ (^J + P»/J + • 

I k p r 

Dffferentiating with respect to the f, and ti, respectively, we obtain, using (7.6), 
(7.7), and (7.8), 

of, k 

(7.18) — = 4[f,- Di.) -b V] Di,, + ih di.)k + Vk dii,,i)]. 

07 ), h 

The stationary equations (7.2) and (7.3) can now be rc-written by means of 
(7.14), (7,15), (7.17), and (7,18) as: 

(7.19) 22 (?* h.jk + Vk hi,A.,) = Do,, + + 22 (t* 4- 7)* rfl.*,)] 

k k 

(7.20) 22 {hh.ik + r)k]ii,jk) = + 7); Dz.y -|- 22 i^kdujk 4- 7)*, dn,i*)]. 

k h 

These are the equations to be solved numerically for the maximizing f, and r/y. 

Before showmg a procedure for the numeiical solution, let us verify that a 
solution of (7.19) and (7.20) will satisfy (5.23). From (7.1), (5.17), and (7.8), 

(7.21) F=^E(f*Do.A,4-»)iA.*). 

Summing both members of (7.19) over y shows that 

(1 ~ S (^kDo.k 4“ VkDi^k) ~ 0, 

k 

Or, from (7.21), 

(1 - E^)V = 0. 

Hence, iiE^ 1, the corresponding solution will satisfy the condition that F = 0. 

As in the unrestricted case, there is always a trivial solution that will yield an 
F* formally equal to unity. This trivial solution is f, = 1, ?)/ s 0, iVhich makes 
x,p s 1 as in the previous case. These values satisfy (7.19) and (7.20), and 
have F = 1. Of course, F is again not an actual correlation ratio for this trivial 
solution. 
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To obtain a non-trivial solution, it is convenient to write (7.19) and (7.20) in 
matric notation. Let 


(7.22) 


z = II [fi] h/] 11- 


2 is a row vector of 2n elements, the first n elements being the f / and the last n 
elements being the »?, Let 


(7.23) 



[^1,1 fc] 

[Kk,] 

[ht.ik] 


h is 2n X 2n and is symmetric, in fact it is also Gramian, since its elements are 
product sums. Let h,h be Kronecker’s delta, and let 


(7.24) 


c = 


[L>o,3 ^jk + do.,fc] [Di.j 5,*; + di.jjt] 
[•Dl.j fiji + dl.ij + dll,,ifc] 


c also is 2n X 27i, symmetric, and Gramian. Agam let 

(7.25) X = 


Equations (7.19) and (7.20) can now be stated as a single matric equation: 

(7.26) zh = Xzc. 


In general, c will be nonaingular, so that it will have an inverse by which both 
members of (7.26) can be postmultiplied to yield 

(7.27) zhc~^ = Xz. 

Therefore z is a latent vector of hc~\ and X is a latent root. Since we want the 
largest correlation ratio, we seek the largest of the non-trivial latent roots 
The largest root in practice will ordmarily be unique. There is then a unique 
latent vector correspondmg to this root, and the elements of this vector provide 
the most consistent f, and rj, for the population in the sense of least squares. 

That c is Gramian and in general nonsingular, that the latent roots of hc~^ 
are all nonnegative, and that the latent vectors of are all real, requires only 
proofs analogous to those for the corresponding properties of A -f C and h(A + 
C)~^ in the previous section, which need not be repeated here. 

As in the previous section, the final numerical steps can bo carried out by 
iterations according to (7.27). Again, the trial vectors should be adjusted to 
conform to (5.23) to prevent convergence to the trivial solution. 

A marginal mdication of the consistency of the quantification is the agreement 
in sign of 

(P - Qh, + (r - s)vk 

with 

^iiklqs.pr , 


for all comparisons. 
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Appendix; A distinction between the conventional principle and the present 
principle. The relationship between the conventional principle of estimating 
means of hypothetical distributions and the present principle of reproducing 
the comparisons of each individual will be analyzed here for the case of ordi¬ 
nary comparisons. Only the 'prinevples wdl be contrasted here. 

In the conventional approach, it is assumed that each of the JV individuals 
has a numerical value for each of the 0,. Let Si, be such a value of 0, for the 
ith individual. The hypothesis is that person i makes the judgment 0, > Or if 
Sm > s,h ; and the conventional problem is to estimate from the judgments what 
the relative distances are between the means iij, where 

(A.l) Mj = ^ • 

The ranges of the subscripts are: f = 1, 2, ■ • • , iV; i, fc, 2 = 1,2, • • ■ , 71 ; and will 
not be explicitly mdicated. 

According to the approach of this paper, if we are io consider hypothetical 
variables, the pioblem would be to determine for each 0/ a numerical value X/ 
such that the differences {x/ ~ Xk) will best approximate the (s</ — s,’*) for each 
individual in the sense of least squares. This will separate “higher than” x~ 
values from “lower than” a:-values. If we let 

(A.2) Z = £ L S [(Sr,- - a.*) - Wi ix, - a;*)]’, 

t y k 

whete w, is a constant of proportionality to be determined for each individual 
separately, then the problem is to determine the Xj and the which will mini¬ 
mize Z. 

Differentiating Z with respect to the Wi and x/ respectively, and setting the 
derivatives equal to zero, yields the stationary equations 

(A.3) — S,) Wi (a;, — 2)] = 0 

t 

(AA) 53 (a;* - S)(srt — WiXk) - 0, 

k 

where 

(A'5) s. = ^ 53 , » = - 53 . 

U k fl If 

Smee Z is invariant with respect to translations of the xi (also to trandations 
of the stj), the origin of the x, is arbitrary, and there is no loss in generality in 
setting 

(A.6) X = 0. 

Then if we let 

* ^ 
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equations (A.3) and (A.4) can be re-written respectively as 


(A.8) 

E Wt (St, - S.) = 

% 

(A.9) 

E Xh Stk = 


h 


By summing both members of (A.8) over j, we see that 
(A. 10) a;, = 0. 

J 

Therefore, since in general a > 0, we must have 2 = 0; and a solution of (A.8) 
will necessarily be consistent with (A.6). 

Usmg (A.9) in (A.8) yields the stationary equations for the Xj alone: 

(A.ll) Xfc y !! Sifc(S|/ “Si) tX^Xj . 

k \ 

This shows that the x j are elements of a latent vector corresponding to a latent 
root 0/3 of the n X n matrix defined by the elements S,k , where 

(A.12) Sjit = Sifc(S(, “ s,) = StjStk “ — 22 • 

t t Tif I \ 

To determine which one of the latent roots provides the minimum Z, we first 
notice—by multiplying both members of (A.9) by w,, summing over r, and using 
(A.7)—^that 

(A. 13) 52 22 a:* Sik Wt = 0/3. 

1 h 

Then expanding the right member of (A.2) with the aid of (A.9) and (A.13), we 
obtain 

(A.14) Z/2n = E E(s., - 5,)' - ali. 

» J 

Clearly, Z will be minimized if we use the largest a/3. Therefore, we seek the 
latent vector associated with the largest latent root of |1 S,k [1. 

To examine the relation of the elements of this minimizing latent vector to the 
means tij of the hypothetical variables, denote the variances and correlations 
of the hypothetical variables by: 


(A.15) 

(A.16) 


^ E (s.j - ttif = ^ E Su - M? 


E (Sit — Mj)(s.ft ~ Hk) 

i 

ATff/ tfk 


E ^11 Si* My M* 


ff, Cfk 


Then 

(A.17) 


Plk = 


E SiiSa A(<r, OA P)A + M, Ph) . 
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From (A.17) and the last member of (A.12), we can write 

(A.18) S,k = o-j (Ti p,t, + /:<,• Mfc - ^ S (o'* Pki + Pk Pi) ■ 

A I 

The elements of the matrix of which the Xj are a latent vector are now ex¬ 
pressed in terms of the means, variances, and correlations of the hypothetical 
variables, according to the right member of (A.18). It is clear that in general, 
the n, are not elements of a latent vector of jj »S,i H, so that our approach is in 
general not equivalent to the conventional approach. 

In the special case of equal variances and correlations, such as is often as¬ 
sumed in the conventional approach,* we can now see that the n, do define a 
latent vector. For tliis case, let the common variance be o-*, and let the common 
correlation coefficient be p. Then 

(A,19) Pifc = p + ^itCl ~ p)i 

where Sja is Kronecker’s delta, and (A.18) becomes 

(A.20) i - P)(s,k - + (m, - P)pk, 

where 

(A.21) 

From (A.20) and (A.12), (A.ll) becomes converted to 
(A.22) [7 — (r“(l — p)] X, = (p, — il)J2phXk, 

k 

where 

(A.23) 7 = a^/N . 

Multiplying both members of (A.22) by Xi and summing over j shows that 

(A 24) (2^ fi, x,f = ^[7 — a {I — p)] . 

1 

From (A,22) and (A.24) we obtain the elements of the minimizing latent vector 
for 2 to be, m normalized form, 

(A.25) -£l = f^>' - P 

Vp Vy - <rHl - p) ■ 

That this is the minimizing vector follows from the fact that the remaining 
latent roots must all have 7 = cr (1 — p) in order to have vectors distinct from 
(A.25); (A.25) does correspond to the largest nontrivial root, since for it the 

* More specifically, zero correlations are assumed, but this is not necessary for our 
puTpose. 
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root satisfies the inequality 7 > ^ (1 - p). (The remaining latent vectors are 
not uniquely defined, for they all correspond to equal roots,) Therefore, the 
means of the hypothetical vaiiables are a linear function of the elements of the 
minimizing latent vector for the case of equal variances and correlations 
As a final comment, it‘should be pointed out that paired comparisons are 
insufficient to estimate the hypothetical values. Two persons with widely 
different hypothetical values will make the same judgments provided only that 
their values have the same rank order Therefore, hypotheses about variables 
presumed to underlie the comparisons cannot be completely tested only on the 
basis of the comparisons 

Psychologically, it may or may not be proper to assume that judgments of the 
type 0, > Ok can be expressed as a function of differences s„ - s,*. Perhaps, 
psychologically, comparisons may operate on some more complicated principle. 
The approach presented in the body of this paper does not assume anythmg 
about underlying variables, but simply seeks a set of numerical values that will 
best help reproduce the observed data for each individual. 
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RELATIVE ACCURACY OF SYSTEMATIC AND STRATIFIED RANDOM 
SAMPLES FOR A CERTAIN CLASS OF POPULATIONS^ 

Bt W. G. Cochran 
Iowa State College 

1. Summary, A type of population frequently encoufitered in extensive 
Hdin plin gR IS one b which the variance within a group of elements increases 
steadily as the size of the group increases. This class of populations may be 
represented by a model in which the elements are serially correlated, the correla¬ 
tion between two elements being a positive and monotone decreasing function 
of the distance apart of the elements. For populations of this type, the relative 
efficiencies are compared for a systematic sample of every kth. element, a stratified 
random sample with one element per stratum and a random sample. 

The stratified random sample is always at least as accurate on the average 
as the random sample and its relative efficiency is a monotone increasing function 
of the size of the sample No general result is valid for the relative efficiency of 
the systematic sample. In fact, there are populations in the class in which the 
systematic sample is more accurate than the stratified sample for one sampling 
rate, but is less accurate than the random sample for another sampling rate. 
If, however, the correlogram is in addition concave upwards, the systematic 
sample is on the average more accurate than the stratified sample for any size 
of sample 

Some numerical results are given for the cases in which the correlogram is (i) 
linear (ii) exponential 

2. Introduction, We consider a finite population consisting of the elements 

xijXi, , where n and k are integers. A systematic sample is drawn by 

choosing an element at random from the elements x;, • • ■ , xi,, and then selecting 
every kth consecutive element. That is, if x,- is the element first chosen, tlie 
systematic sample comprises the elements x<, xi+t, ■ • • , x< 4 .(„_i)k. This type 
of sample has found considerable use in practice, because it is often easier to 
select and to administer than a random or stratified random sample and because 
it has an intuitive appeal through spreading the sample evenly over the popula¬ 
tion Much remains to be learned, however, about the accuracy of this system¬ 
atic sample relative to that of comparable random or restricted random samples. 
Probably the most relevant comparison is that between the systematic sample 
and the stratified random sample having one element per stratum. In the latter 
case, the population is divided into the n strata {xi, • ■ • , x*], {xt+v, •' • , 
X 21 ;), • • , and one element is chosen independently at random from each of the 
strata. This type of sample is similar in many respects to the systematic 

* Journal paper No. J-1341 of the Iowa Agricultural Experiment Station, Ameg, Iowa. 
Project 891. , p, w 
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sample. Both divide the population into the same n strata of k elements each, 
with one element chosen from each stratum. Moreover, neither sample provides 
the data for an unbiased estimate of the sampling variance of the sample mean, 
at least in the sense that the estimate is unbiased whatever the form of the 
population of elements x,. 

The first thorough investigation of the properties of systematic samples was 
made by W. G. and L. H Madow [1]. In particular, these authors compared 
the accuracies of a systematic sample and a stratified random sample of the types 
described above for several types of finite population. Where the elements in 
the population lie on the line x, = i, they showed that the stratified random 
sample, with one element per stratum, is more accurate than the systematic 
sample. If the population has a periodic distribution, the stratified random 
sample is superior when fc is an integral multiple of the period, but the system¬ 
atic sample is superior when k is an odd multiple of the half-period. The authors 
also considered the more complex case where the population contains both a trend 
function and a periodic function. 

The object of this paper is to make similar comparisons for another type of 
population which appears to be fairly frequently encountered in extensive 
samplings The population is one in which the variance among the elements in 
any group of contig:uous elements increases steadily as the size of the group 
increases. This type of population has long been regarded as applicable in field 
experimental work, where the variance among plots within a block is found 
usually to increase with the size of block. Summarizing data from 40 uniformity 
trials, Fairfield Smith [2] verified this notion and derived an empirical relation¬ 
ship from which the rate of increase may be estimated. The same type of popu¬ 
lation is also considered in several recent papers on extensive sample surveys. 
Thus, in a discussion of methods for sampling farm populations, lessen [3] 
postulated a law in which the variance among farms within a grid is a monotone 
increasing function of the size of the grid and used the law for estimating the 
optimum number of farms which should be included in a sampling-unit. 
Mahalanobis [4] independently developed the same law as Fairfield Smith in a 
comprehensive investigation of large-scale sample surveys. Hansen and Hurwitz 
[5] referred to the increase m variance within a cluster with growing size of cluster 
as typical of many actual populations Numerous other references could be 
given. 

3. Specification of the population. Various mathematical models may be 
constructed to represent the situation in which the variance within any group 
increases with increasing size of group. For instance, Ave might consider that 
the elements x, are drawm from different populations, the population changing in 
some regular manner Avith i. Alternatively, the Xi may be assumed to belong 
to the same population, but to be serially correlated. For simplicity, Ave assume 
further that the serial correlation between Xi and a.+u is some quantity pu which 
depends only on u. Then if pu is positive and is a monotone decreasing function 
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of u, it may be expected from intuitioa (and will be proved later) that the 
variance mthm the group of elements a;,, i.+i, • ■ ■ , aJi+fc is a monotone increasing 
function of k This model seems appropriate for our purpose, since many writers 
refer explicitly to positive correlations between the x’s as the basis for the 
phenomenon of increasmg variance. 

The specification above ivill be qualified m one respect. To assume that the 
p’s are stricUy monotone for an actual finite population of only moderate size 
does not seem realistic While the correlogram may exliibit a definite downward 
trend, yet individual fluctuations about the trend prevent the correlogram from 
being strictly monotone. It is more reasonable to regard the finite population 
as being itself a sample from an infinite population in which the p’s are monotone. 
This attitude is, I believe, in accord with that of the authors referred to above, 
who, as I interpiet their writings, regard the variance law as holding in an ideal¬ 
ized population Thus, comparisons between the systematic and stratified ran¬ 
dom samples will be made not for a single finite population, but for the average of 
finite populations dravra from an infinite population with monotone decreasing p. 
Results for an individual finite population will differ from the average results 
because the r’s which appeal in the population fluctuate about their expectations 
p. As the finite population becomes larger, its results will tend to coincide with 
the average results. 

Accordmgly, the elements x,, i = 1, 2, ■ ■ ■ ,nk, are assumed to be drawn from 
a population in which 

= p, E{xi - iif = c, E{x, - p)(a:<+u - p) = pucr® 
where Pu > p« > 0, Avhenever u < v. 


4. Some useful preliminary formulas. If x is the mean of a specified finite 
population, the following algebraic identity, frequently useful in the analysis of 
variance, is easily established. 

(1) (kn) 2 (Xi - = 12 

i-i 1—1 j>« 

Smee there are {kn){,kn — l)/2 possible pairs of values (x,, Xy), this gives 

(2) Z (X. - X)^ = E(X, - x,f = ~ 


where E is taken over the finite population. Now expand the quadratic and 
average over all finite populations. In the {kn){kn — l)/2 combinations, there 
are {kn — 1) in which j exceeds f by 1, {kn — 2) in which j exceeds i by 2, and 
so on. Hence 


(3) 


k n 

Z (^1 ~ = {kn — 1) <r“ 



2 

{Jen) {kn — 1) 


/;n—1 

Z {kn — 


U»1 



To obtam the corresponding expectation for the sum of squares within a single 
stratum of k consecutive elements, we need only replace {kn) by A in (3). Since 
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the result is the same for all n strata, we obtain 

i o''-! ■) 

(4) E (S S. within strata) = ii(k — l)ff^ jl — ^ ^ ^ 2 0^ ~ m)pu|- 

Formula (3) also gives the expected sum of squares within a specified system¬ 
atic sample if we replace (kn) by n and u by (ku), since there are n elements in 
the sample and since the correlations between successive elements are pk , pn, ■ ■ • 
instead of pi, /) 2 , ■ • • The result is the same for each of the k systematic 
samples. Hence 

( 2 

(5) E {8. S. within systematic samples) = A (n — 1) o- ^1 — ^ 

n-l -j 

' (R W) Pku r t 


6. Average variance for a random sample. The symbols al, ah , will be 
used to denote the average variances of the means of the random, stratified ran¬ 
dom and systematic samples, respectively, about the mean of the finite popula¬ 
tion, this average being taken over all finite populations drawn from the mfinite 
population specified in the previous section Comparisons with the random 
sample, though not our mam purpose, will be mcluded where they are of interest, 
For a single finite population, it has been shown by several writers that the 
variance of the mean of a random sample is 


( 6 ) 


1 _ {kn - n) . _1_ y (ar _ fV 
n {kn-\) knt^x^' ^ 


where x is the mean of the finite population. 
From (3), we obtain 


(7) 



{kn) {kn 


k Ti—1 

S Ocn - u) P„|, 


6. Average variance for a stratified random sample. If x,t is the mean of a 
typical stratified random sample, the sampling variance of x,i is by definition 

( 8 ) E{x.t - xY. 

Consider first the average over a single finite population. Let xi, xi, - Xn 
be the means of the n strata, respectively, and let Xi,, x^, ■ ■ ■ , i„,- be the ele¬ 
ments selected from the respective strata Then (8) may be written 

(9) “2 f (^ij ~ “b ~ ^ 2 ) + ■ ■ ■ + {xni — ^n) }* 

n n 

2 Xtj = nf.i and £ = nx. 


since 
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Take the average over all fc" samples from the finite population. All cross- 
product terms vanish, since, for example, appears equally often with xu , X 21 , 
,Xik. This gives 

s S § 

for the variance for a smgle finite population. The sum of squares involved is, 
of course, simply the sum of squares within strata. Hence, by (4) 



7. Average variance for the systematic sample. If is the mean of a typical 
sample, the variance for a single finite population is 

(12) E (x.y - xf = ^ (nS (x., - 5)“) 

where the sum is taken oi'er the k systematic samples. Smce the sum of squares 
among samples is equal to the total sum of squares in the population minus the 
sum of squares within samples, (12) equals 

1 fcn 2 ■ 

(13) 1 — £ ( 2^1 “ •S)'^ ~ r- (S- S. within systematic samples). 

ku >-1 ku 


To obtain the average over all finite populations we substitute from (3) and 
(5) for the first and second terms respectively. The result is 


(14) = 


2 _ {kn — 1) 0 J 


kn 


This reduces to 


fcn—1 


r 


ikn) {kn - 1) S 


(n — 1) 2 

- (T 


1 - 


n{n 


S {n - u)piu|. 


(15) 



2 

kn{k — 1) 


cn—1 

z 


u-1 


{kn — u) 


It should be noted that the formulas and notations above are different from 
those used by the Madows, who define p and cr* with reference to a single finite 
population and discuss the sample variances for a single finite population. 


8. Relative accuracies of random and stratified random samples. First, some 
general comments From (7), (11) and (15) the relative efficiencies of the three 
types of sample are seen to depend only on the linear functions of the p’s which 
appear in tr,, Xst, and (r,„. It is easy to verify that in each case the sum of the 
coefficients of the p’s is unity. For the random sample, the linear function in- 
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volves every serial correlation up to lag (kn — 1) with, coefficients which decrease 
linearly as the lag increases and are independent of the size of sample, depending 
only on iV = (kn), the number of elements in the finite population. For the 
stratified random sample, only serial correlations with lags up to {k — 1) appear, 
k bemg the number of elements in the stratum. As presented in (16), the 
formula for the systematic sample is separated into two linear functions. The 
first is the same function as appears in the formula for the random sample except 
that all coefficients are {kn — l)/(k — 1) times as large The second, which 
carries a positive sign, involves correlations where the lag is a multiple of k. 

Thus far the formulae require no restrictions on the p’s In considering the 
case where the p’s are positive and monotone decreasing, the following lemma is 
helpful. 

Lemma. 7/ p,, (i = 1, • • ■ , m), are positwe and monotone decreasing, that is, 
p. > Pi+i > 0 ciTid f/ (ai + ci 2 -jr • • ■ + am) is zero, the necessary and sufficient 
conditions that 

(16) L = aipi + aaPa + ■ ’ • + amPm > 0, /or oil admissible sels of p‘s, 

(17) are ai + aa + • • • + a. > 0, f = 1, 2, • • • , (m — 1). 

For let p, = p,Hi + , where by hypothesis 5, > 0. Then if we substitute 

successively for pi , pa , • • • , pm_i in terras of 5i , , • ’ • , 5m-i, we find 

(18) L = ai5i + (ai + “ 2)^2 + (ai + a2 + 03 ) 8 ^ + * • • 

+ (ai + «2 + • • • "h am_i)5in_i, 

the final term in pm vanishing because (ai + • • • + am) is zero. Since all 5, > 0, 
the sufficiency of (17) is obvious. Also, if for any i the coefficient of 5, is negative, 
we can make L negative by choosing that S< as positive and all other 5’a as zero. 
This establishes necessity. 

Corollary. If p, are strongly monotone, i.e., p, > p,+i, and if at least one of 
the a, is different from zero, conditions (17) are sufficient to establish that L exceeds 
zero. For in (18) all the 5’s are greater than zero and by (17) none of the S’s has 
a negative coefficient. Further, the coefficient of at least one of the 5's must 
exceed zero, otherwise all the a’s would be zei’o. Hence L > 0. 

We now show that if the pu are monotone decreasing, 

(19) L{k) = S (fc - «)Pu 

is a monotone decreasing function of k. This is the Imear function which appears 
in the variance of the stratified sample. 

(20) L{k) - L{k + 1) = 5, ~ {k + l)/c S ^ ^ ~ 

2 * 

A(fc“ - 1) S ^ ~ ■ 


(21) 
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Since the sums of the coefficients of the p„ are unity in L{k) and L{k + 1), 
the sum is zero in (21). Hence the lemma may be applied. But it is obvious 
that the sum of the first i coefficients in (21) exceeds zero, since the coefficients 
are all positive for u < (/c + l)/2 and all negative for u > {k + l)/2. Hence 

(22) L{k) - Lik + 1) > 0. 


Further, by the corollary, if the Pu are strongly monotone, L(fc) is strongly mono¬ 
tone Since all p„ are positive, this result is sufficient to prove that 




2 nft—1 

ink){nk - 1) S ~ ■ 


Consequently, for any size of sample the average variance of the stratified sample 
cannot exceed that of the random sample. Further, the relative efficiency of the 
stratified sample to the random sample is monotone increasing with decreasing 
size of stratum, i e. with increasing size of sample. There is, of course, nothing 
unexpected in these results. Equation (22) also establishes the result mentioned 
in the thiid section, that with monotone decreasing p, the average variance with- 
m strata increases steadily as the size of stratum increases. For if nik — 1) de¬ 
grees of freedom are assigned to the sum of squares within strata, formula (4) 
above shows that the average variance within strata is 

“ k{k - ij S 1 1 ■ 

9, Comparison of the systematic and random samples. Upon investigation, 
it is soon evident that no general results can be established about the efficiency 
of the systematic sample relative to the random samples, unless further restric¬ 
tions are made on the form of the population. In order to apply the lemma, we 
find the sums of the first i coefficients of the linear functions of p which appear 
m the variance formulae (7), (11) and (16) By elementary methods these sums 
are found to be 


(25) 


2:.= 


i{2nk — f — 1) 
nk(Tik — 1 ) 


r- 


t(2A: - i — 1) 
k{k - 1) ’ 


1 


1 < f < (fc - 1) 

i > k. 


T... = 1 - 1) _ rfc(2n - r - 1) 

nk{k - 1) n{k - 1) ’ 

where r is the integer such that (r + l)ifc > i> rk. 

™ establish o-J, < o-J,, it would be necessary to show 

that for any t. Now if i is less than k, so that r is zero, clearly 
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(26) Z.V > E.t > Er, i=l,2,---,Qc- !)• 

except when n is 1, in which case all three are equal. 

But if i IS an integral multiple of k, say rk, we find 

w 2..-I, 2..=^:, 

BO that 

(28) E.< >'Zr> . 

Consequently the conditions of the lemma are not satisfied with regard to the 
systematic sample and no general theorem exists for all populations with mono¬ 
tone decreasing p. The result (26) and the corollary show that for any popula¬ 
tion in this class which has pu = 0, u > {k — 1), the systematic sample is more 
efficient than the stratified random sample. On the other hand, (28) shows that 
m a population with the first k of the p’s equal and the rest zero, the systematic 
sample has a higher variance than a random sample. If these two results are 
collated for a population with the first j of the p’s equal and the rest zero, we see 
that the systematic sample with stratum size j is less accurate than the compar¬ 
able random sample, while the systematic sample with stratum size (j + 1) is 
more accurate than the comparable stratified random sample. Although such 
a population may not occur in practice, the result suggests that the graph of the 
variance of the mean against the size of sample is unlikely to exhibit the same 
regularity for the systematic as for the random samples. 

10. Populations in which the correlogram is concave upwards. Further 
investigation shows that the deciding factors in determining the relative accura¬ 
cies of the systematic and random samples are the second differences of the pu 
rather than the first differences. The foUowmg result will be proved. 

Theorem : For all infinite populahons in which 

Pi > Pi+i > 0, i = 1, 2, • - • , (fcn — 1), 
and 

= Pi -1 -h p«+i — 2pj > 0, i = 2, 3, • • • , (fcn — 2), 

then 

2 ^ 2 2 
^»v ^ ^9i ^ 

for any size of sample Further, <rly < , unless S? = 0, i = 2, 3, • • • , (A:n — 2). 

This result can be proved by expressmg the Imear functions of the pu in terms 
of second differences and establishing a new lemma applicable to second differ¬ 
ences. An alternative approach is simpler and perhaps more instructive. 

Since the pu are monotone decreasing, o-Jj < (rj by the results in section 8. In 
(13) above, the variance of the mean of a systematic sample for a specified finite 
population was expressed as 
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1 1 

— 2 (^> ~ “ u; (Total S.S. within systematic samples) 

h/tn f^fl 


(29) 


kn ^ 


1 *" 1 

= _ V (x. — -(Average S.S. within a systematic sample). 

knti 

k corresponding equation holds for stratified random samples. For if xiji 
Xs,, ■ ■■ , x„j are the elements of any stratified random sample with mean S., 


(30) 


2 (xtj — ^ (x«y — f»i)* + n(£j| — J)’. 


•-1 


Now take the average over all A;* samples. This gives 


1 

(31) - 22 {x, — = (Average S.S. within samples) + nE{S,t — x)“. 

k i>i 


Since the term on the extreme right is n times the variance of the stratified 
random sample, a result analogous to (29) follows at once. 

Consequently, <rjy < o-Jt if the average sum of squares mthin a systematic 
sample is greater than or equal to that within a stratified random sample. Now 
by (2), with n in place of (kn), each of these averages is equal to 


(32) 


(IL^) E(X,J - X,y)* 


where xy, are the elements in the sample from the tth and the fth strata 
respectively, the average being taken over all possible pairs of strata. 

We consider a fixed pair of strata and let 1 — t = u. For the systematic 
sample, corresponding elements in the ith and 1th strata are always (fcu) elements 
apart. Hence, 

(33) E,y (xij Xiy) = 2<r (1 — ptw). 

For the stratified random sample, there are fc* possible pairs of elements from 
the two strata. One pair is (ku — fc + 1) elements apart, two pairs are 
(fcu — fc + 2) elements apart, and so on, the numbers of pairs rising linearly to 
fc and then decreasing linearly to one for the final pair which are (fcu + fc — 1) 
elements apart. This gives 

( 34 ) F.,(x„ - x,jf = 2 <r* |l - i (fc - 111 )p*„ . 

Hence, to complete the proof that < aji, it is sufficient to show that 

C*-i) 

(3®) 52 (^ “ 1 f 1 )p»iH-< ~ k* ptu > 0 

(k-1) 

for u = 1, 2, • • • , (n — 1), that is, for any pair of strata. This may be written 

(*-i> 

(^®) 52 (^ ~ i)(p*«i+« + piu-t — 2ptu) > 0. 

i»i 
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But if 5fcii — pfcu^i "H pfeu-f-i 
show that 


2pi,u is the second central difference it is easj’' to 


(i-i) 

(37) phuH + Pku-i — ^Pku =2 (i ~ ^ Oj 

since by hypothesis 5^ > 0, j = 2, 3, • • • , (fcn — 2). This proves that the 
variance between the elements of the systematic sample is greater than or equal 
to that between the elements of the stratified random sample for any fixed pair 
of strata. The result for the overall average follows. Hence < o-Jj. 
Further, unless a-] = 0, for all clearly <rjy < cr^, except for samples of one. 

The essential point in the proof may be put as follows. The elements in the 
ith and Zth strata are on the average (ku) elements apart for both the systematic 
and the stratified random sample. When two elements in the latter sample are 
(ku + *) elements apart, they are less correlated than on the average, since 
piu+i < piu 1 and thus provide more independent information. The vari¬ 
ance between the elements exceeds the systematic sample variance by 
2a*(pj,u — Piu+i). However, such cases are counterbalanced by an equal num¬ 
ber of cases in which the elements differ by (ku — i) and the variance is below 
the systematic sample variance by 2o-’'(pji,„_, — p^u). Because of the concavity 
of pu , the losses on the average balance or outweigh the gains 

For the population discussed in section 9, in which pu = p, m = 1, 2, • • • , 

Pu = 0, u > we have < 0, > 0, and 6u = 0 otherwise. This reversal 

of the sign of the second difference is the explanation for the anomalous behavior 
of the systematic samples with stratum sizes j and (j 1). 

The theorem above does not prove that the relative accuracy of the systematic 
to the stratified random sample is a monotone function of n, nor even that dy 
decreases steadily as n increases. Actually, there are populations in the class for 
which neither result holds, as will be illustrated in the next section. 

So far as practical applications are concerned, the restriction that the pu should 
be concave upwards may not be severe. For instance, this condition is satisfied 
ivhen the correlogram is linear, i e. p« = (I — u)/l, this being one type of correlo- 
gram which Wold [6] has considered applicable to economic data. Concavity 
also holds for the function pu = e which Osborne [7] has suggested for forestry 
and land-use surveys and for the relation p„ = tanh which Fisher and 

Mackenzie [8] used for expressing the correlation between the weekly rain at 
two weather stations as a function of their distance apart. In fact, if pu is 
conceived of as positive and continuous for all u, a concave upwards function 
suggests itself naturally. 


11. Linear correlograms. it may be of interest to present some results ob¬ 
tained when the correlogram is (i) linear, (ii) exponential, since both types have 
been suggested as possible models for populations occurring in practice. 

In the linear case, 
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(38) P^, = {L - u)/L, u < L] P„ = 0, « > L. 

If L > {nk - 1), the correlogram is a straight line throughout the whole range 
of the finite population. Since all second differences are zero in this case, we may 
expect = o-Ji < crj. If L < (nk — 1), all second differences vanish except 
6i , which is positive. Hence wp may expect cr\^ < trjt < ffj . 

The results for these cases are found by elementary summations from the basic 
formulae (7), (11) and (15). Detaila of the summations will not be presented. 
For i ^ (ni — 1), we find 


(39) 


^ ^1 ^ 


(k+1) 


2 

<rr 


= - (l - ^ 
n \ K 


\(nk + 1) 


3L ’ n V* k/ 3L ' 

The ratio a\le\ is (nfc + l)/(fc + 1), which is approximately equal to n, the size 
of sample, unless the percentage sampled is large. Thus very large gains in 
eflficiency over random sampling are obtained. 

If i < (nfc — 1), the formulae are less simple. Consider first k > L] that is, 
cases where the percentage sampled is less than 100/i. Jf N — nk, 

l\j3N{N-L)+iL^-l)] 
kj \ 3N{N - 1) 

l^|^(i:_-L) + (L^ ~ 1)1 


(40) 


(41) 


2 

Cr 


2 

(Tit 


(42) 


2 

<^IV 


TV \ 

■-( 

ft \ 

--('-E 

ft \ k, 


kJ ( 3k{k - 1) 
l\ i3N{k - L) + 0 - 1) 


k>L 


)\ 


3Nih ~ 1) 


k>L, 


It is clear on inspection that < <t*< j moreover, it is easy to show that the 
eflficiency of systematic relative to stratified random sampling increases steadily 
as the size of sample increases. 

When the size of sample is increased further so that k < L, formula (40) 
remains unchanged, while <r]i is now given by the same formula as in (39). The 
formula for is more complex. If q is the integral part of the quotient when L 
is divided by k and r is the remainder, so that L = {qk + r), the formula may be 
written 


2 

<T,b 


(420 




1) 


k < L. 


jqkik^ - 1) + 3rk{n - q){k ~ r) + r(r* 

\ 3NL{k - 1) 

It is noteworthy that the last two terms in the numerator inside the curly 
bracket vanish whenever L is exactly divisible by k. Further, the second term is 
of order nk = N and, when present, exerts a much greater weight than the first 
term. Thus takes a sudden dip whenever Z/ is a multiple of k. In fact, for 
L = qk, (420 reduces to 


(43) 


= ^.Vi _ (fc + 1) 
ft V 


ic) 3N 


L = qk, 
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so that the variance goes to zero if N is sufficiently large. By comparifton with 
formula (39) for <r^i we see that when L = gk the relative efficiency of systematic 
to stratified random sampling is N/L, which increases beyond bound if N ia 
sufficiently large. In intermediate cases, when the remainder r does not vanish, 
the leading term in the relative efficiency for N large is (k^ — l)/Zr{k ~ r). 
This varies somewhat irregularly, depending on the relation between L and k, 

To illustrate, numerical values are given below when L 10 and the finite 
population is large enough so that terms in 1/n are negligible. 

The quantities v,t, v,^ are the corresponding variances apart from a factor 
o-^/N. The stratified sample variance decreases steadily with inertawing per¬ 
centage sampled. On the other hand the systematic sample variance gets* to 
zero and the relative efficiency to infinity when fc is 2, 6 or 10. Moreover, in the 
intermediate cases fc = 3, 4, 6, 7, 8, 9, the variance and the relative effieiency 
show no consistent relation to the percentage sampled. For samples of Ims than 
10 per cent, including the cases outside the limits of the table, the relative 
efficiency decreases steadily from 4 at A; = 11 to 1 when k is large. 


TABLE 1 

Variances except for a factor <r^/N and relative efficiency for systematic and stratified 
random samples for a linear correlogram 


2 

3 

4 

6 

6 

7 

8 

9 

10 

50 

33 

26 

20 

17 

14 

12 

11 

10 

.10 

.27 

.50 

.80 

1.17 

1.60 

2.10 

2.67 

3.30 

0 

.20 

.40 

0 

.80 

1.20 

1.20 

.80 

0 

00 

1.33 

1.25 

00 

1.46 

1.33 

1.75 

3.33 

00 



12. Exponential correlograms. For the exponential />„ = the results are 
much more regular Each of the linear functions of the p’s consists of a finite 
number of terms of an expansion of the form (1 — x)“^ If 

f441 /(if, X)_- W + 


N(N - 1) 


- 1 )» 


which is the sum for al, we find 


(45) 

2 

(Tr 

n\ k. 

)n-m, X)) 

(46) 

2 

0-st 

= —(l -- 
n\ k, 

)|1 -/(A,X)) 

(47) 

2 

CTay 





n \ 

n {k - 1) 


-(^V(n,AX) 
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It may be shown that the variance of the systematic sample decreases steadily 
and its efficiency relative to stratified sampling increases steadily as the sample 
becomes larger. 

In order to obtain some idea of the magnitude of the gain in efficiency, consider 
the case where fc and ti are large. For this case the relative efficiency, which 
actually is a function of k, n and X, turns out to depend almost entirely on the 
single quantity (fcX); or, equally, on the correlation e between the items in 
successive strata in the systematic sample. If t = (fcX), we obtain a, = a /n, 


(48) 


(49) 


2 

<r,i 


n 


n 


- l-T+r,- 


i ' <2 


2e“ 


1 _2 , _ 

^ t + (e' - l)i • 


\ 


The relative efficiency is given in Table 2 for a selection of values of e~\ the 
correlation between the items in successive strata. 

The relative efficiency has a limiting value 2 when p tends to 1 and decreases 
slowly towards 1 as p falls to zero. The gams in efficiency are quite substantial 
if p exceeds 0.1. 


TABLE 2 


Relalive efficiency of systematic and stratified random samples for an exponential 

correlogram 





n 

m 

n 

WL 

.3 


D 


■^1 

m 


m 

DB 

Di 

1.56 

DQ 

m 


It was pointed out in section. 1 that no unbiased estimate of error is available 
from a single sample for either the systematic or the stratified random sample. 
This does not mean that no estimate of error can be attempted. However, any 
estimate must depend on certain assumptions about the form of the population 
which is being sampled and is likely to be vitiated insofar as these assumptions 
are false. If, for instance, the correlogram were assumed to be exponential, 
formula (47), or (49) in the particular case with n, k large, would appear to be 
the appropriate basis for the estimation of error from a single systematic sample. 
Consider the simpler case in which (49) is valid. The correlation between 
successive items in the systematic .sample provides an estimate of e~' and hence 
of I Also, if terms in l/n are negligible, the mean square within the systematic 
sample is found to be an unbiased estimate of tr*. By substitution in (49) a 
consistent estimate of the variance of a single systematic sample would be secured, 
provided that the exponential assumption were correct. The gains in efficiency 
over stratified and random sampling could also be estimated. 
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OPERATING CHARACTERISTICS FOR THE COMMON STATISTICAL 
TESTS OF SIGNIFICANCE 

By Charles D, Ferris, Frank E Grubbs, Chalmers L, Weaver 
Ballistic Research Laboratory, Aberdeen Proving Ground 

1. Summaiy. Methods making possible quick calculation of operating char¬ 
acteristics or power curves of common tests of significance involving the x^, 
F, t, and normal distributions are presented. In addition, a comprehensive sot 
of curves illustrating graphically the power of each test for the 5% significance 
level are included We are interested in the pmver of; (1) the x“-test to deter¬ 
mine whether an unknown population standard deviation ia greater or less than a 
standard value. (2) the F test to determine whether one unknown population 
standard deviation is greater than another (one-sided alternative), and (3) the 
i-test and normal test to determine whether an unknown population mean 
differs from a standard or two unknown population means differ from each other. 
Such operatmg characteristics have application for the quality control engineer 
and statistician in the design of sampling inspection plans using variables whore 
they may be used to determme the sample size that will guarantee a specified 
consumer’s and producer’s risk. On the other hand they are of use in displaying 
the power of a test if the sample size has already been set. Finally, they are a 
necessary adjunct to the proper interpretation of the common tests of significance, 

2. Introduction. In the application of the common statistical testa of sig¬ 
nificance there has been a great need for readily accessible information on the 
power of the test employed to distinguish between the null hypothesis and perti¬ 
nent alternative hypotheses for given sample size. In this connection, two im¬ 
portant applications arise. On one hand it becomes important for tho sampler 
to know, for a given sample size and critical region, something about the power 
of the test m rejecting the stated hypothesis when some alternative hypothesis ia 
true. On the other hand, if the sampler wants a given degree of assurance in 
rejecting the null hypothesis when a particular alternative is true, he would like 
to know the irunimum sample size which would accomplish this when tho prob¬ 
ability of rejecting the null hypothesis when true is given. In particular, the 
need for such information arises most frequently in sottmg sample sizes to dis¬ 
tinguish effectively, on the basis of single sample results, between (1) population 
standard deviations and (2) population means. If the sample size has already 
been set, as is the case with most specifications, quick information on whether 
or not it is large enough to keep the risk of accepting poor material do^vn to a 
reasonable figure is highly desirable. Such probabilities will be recognized, of 
course, as the Type I and Type II errors of the Neyman-Pearson theory. Such 
risks must be given proper consideration in the interpretation of a significance 
test or in designing the provisions of an acceptance test. 
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Needless to say, the appropriate expressions for the power functions of the 
X -test, F-test, normal-test, and i!-test have been derived at one time or another 
in the literatuer. However, msofar as the practical statistician or quality con¬ 
trol engineer is concerned, such information has not been employed to advantage 
widely since no informative graphs or extensive tables of power functions for the 
common statistical tests of significance have been presented. Due to the prac¬ 
tical importance of questions of this type, the authors believe there is need for 
operating characteristics or graphical power functions of the common statistical 
tests of significance. This paper supplies such a need over a useful range of 
sample sizes and alternative hynotheses for the 5% significance level. 


3. Definitions. In the following account, we will refer to one or both of the 
normal populations, iri and xj. We will let ii be a variate from xi, whose expected 
value or mean is mi and standard deviation <ri. By rii we will mean the number 
of observations drawn at random from xi and our sample statistics will be 
defined in the usual fashion: 

Tf I , ni 

£i = S xi/ni, Si = 2 (xi ~ xiffiui — 1). 

1 1 

Similar definitions apply to the normal population X 2 with the appropriate 
subscript for sample statistics and population values. In dealing with a single 
population we will drop the subscripts from the sample statistics. 

We also define 


<r = a standard or arbitrary value of the standard deviation, 
0 = a standard or given level, 

(xi — Xi)“ + £ (l2 —Xif 


J _ 1 
Sl2 — — 


Til -|- ^2 ”” 2 


when two normal populations 
are encountered. 


Ho will be used to denote the null hypothesis and Hi any one of a set of alter¬ 
native hypotheses. The probability of rejecting the null hypothesis Ho when 
it is true (Type I error) will be denoted by a, and the probability of accepting the 
null hypothesis when some alternative hypothesis Hi is true (Type II error) 
will be denoted by /3. 


4. Power function of the x*-f®st. The statistic x* = -- (dropping 

subscripts of sample statistics) is used to accept or reject the hypothesis that the 
standard deviation, ci, of the normal population sampled is some specified or 
given value, <r. 

Our hypotheses are 


Ho’. <Ti = a 

Hi’, ffi = Xo-, (X > 0). 
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A. To determine whether or not a > a. We choose a significance level, a, 
and compute % = ^ x«, where the percentage point a is 

determined by 


( 1 ) 





du = a 


we reject Ho and conclude that <n > a. 

To set up the power function we note that; 
If Ho is true 


Pr 


(n — l)a* 


> X. 


= a 


If Hi is true 
Ft 

However, since 


(n - l)s\ ^ ^ 5 




Pr 


> X« = 1 - /3, 


(n — l)a* .. j \ 1 

o-f 


(1 - |8 « a, if \ = J). 


or 

~ > X*xi-^} = 1 - ^ 

we have the relation 

\\U = xi or X = 

Therefore, for a given significance level, a (Type I error), and various Type II 
errors, /3, we can make use of the Tables of Percentage Points of the x*-<iistribu- 
tion [1] and compute enough of the points (X, /?) to plot the power curves de¬ 
picted in Fig. 1. The Type I error, a, has been set at the practical level of .05 
for Fig. 1. 

B. To detect o-i < V. We compute 

1 ^ (n - 1 ) 3 ’* 

X n 

and if x < Xi-a we reject- Hq , concluding that <ri < a. 

By reasoning similar to that in A. we arrive at the relationsliip 



Again, by use of the Table of Percentage Points of the x'-Distribution the operat¬ 
ing characteristics of Fig. 2 are obtained. We have chosen the practical level of 
« = .05 for Fig. 2. 
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Exam-ple-} A Rifle Association is purchasing small arms ammunition for 
match purposes It is the desire of the rifle club that the dispersion in muzzle 
velocity of a lot of ammunition intended for match purposes be kept down to a 
practical minimum. Acceptance or rejection of an ammunition lot must, of 
course, be made on a samplmg basis since the ballistic acceptance test is de¬ 
structive m nature Moreover, for practical reasons acceptance of a given lot 
is to be on the basis of a single sample. The Association specifies that they are 
not willing to accept more than 5% of the lots whose standard deviation m 
muzzle velocity is 6 ft./sec. The ammunition manufacturer agrees that he will 
accept these terms provided not more than 5% of the lots whose standard devia¬ 
tion in muzzle velocity is 4 ft./sec will be rejected. Under these agreements, 
it is desired to know what sample size is necessary to provide the stated assur¬ 
ances for the Rifle Association and the ammunition manufacturer. 

In this problem, a = .05, /3 = .05, and X = 1.5. Referring to Fig. 1, we 
find the required sample size is approximately 35. 

On the other hand, if a sample size had already been set, the appropriate 
curve in Pig. 1 could be examined to determine whether it provided sufficient 
protection against the acceptance of inferior ammunition. 


6 . Power function of the R-test. In discussing the power function of the 
F-test we will focus our attention on the problem of comparing the standard 
deviations of two normal populations. 

A. To determine whether or not the standard deviation, cri, of one normal 
population is greater than the standard deviation, trj, of another normal popula¬ 
tion. We choose a significance level, a, and compute F = sl/sl . It F > F„ 
where the percentage pomt F, is determined by ’ 


( 2 ) 


r[a(n.i fla — 2 )] 

r[Mni - l)]r[Kn2 - 1)] 


(n, - (ns - 



u' 




[(ni — l)n -f n* — "" 


a, 


we conclude that <xi > <ri. 
Our hypotheses are 


Ho', ffi = ffs 

Hi: <ri ~ Xffs, (X > 1). 

To set up the power function of the F-test we note that: 
If Ho is true 


Pr{sl/sl > F.) a, 


' This example is used to illustrate the use of the power of the 
cated as a most powerful sampling technique (See ref. [10]). 


x*-teat and 


IB not ad VO- 
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If Hi is true 

Pr\8\/a\ > Fa) = 1 - (1 - /3 = a if X = I). 

However, since 

or 

Pr(s?/S2 > X'F,-;.) = 1 - |9, 

we have the relation = Fa or X = 

Therefore, for a given Type I error, a, and various Type II errors, we can 
make use of the Table of Percentage Points of the F-Distribution [2] and com¬ 
pute sufficient points (X, d) to plot the operating characteristics depicted in 
Figs. 3, 4, and 5 In these figures, a has been set at the practical level of ,05. 

It should be emphasized that the operating characteristics presented in this 
paper are applicable only when one is interested in the one-sided alternative that 
ai > and not in < . Under these circumstances, the exact formation of the 

F ratio will be set beforehand and will not depend upon test results (for example, 
placing the greatest mean square in the numerator). In those cases where one 
is interested in the two-sided alternative, a two-tail F-test such as described by 
H. Scheff4 [3] should be used. It is hoped that at a later date operating char¬ 
acteristics of such a test calculated in a manner similar to the example in [3] 
will be presented, 

Example; It became necessary for a noanufacturer to make a choice between 
a new type casting and one produced under standard design practices. One of 
the bases of comparison was dispersion in tensile strength. It was considered 
that if the standard deviation of the standard casting were larger than the new 
type, definite preference should be given to the latter. When the question of a 
practical criterion for rejecting the standard casting was considered, it was 
decided that if its true standard deviation m tensile strength were actually 
times that of the new type there should be a 90% chance of rejection. It would 
be of little practical importance to detect any ratio less than in this particular 
case. It was also decided that the 5% significance level would suffice insofar 
as rejection of equal quality -was concerned. A preliminary sample size of 20 
was selected, and the question arose as to how well a sample of this size gave the 
protection desired. 

The question can be answered immediately by reference to Fig. 3 (hero si 
is computed from the standard casting data, of course) where it is seen that a 
sample size of 20 will fail to detect the stated difference 47% of the time. In 
order to achieve the desired protection, it is seen at once from Fig. 3 that a 
sample size of over 50 -will be necessary. The exact sample size, determined 
with the aid of the formulas above, is found to be 54. 
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Fig. 5. Operating Characteristics of the F-Tbst F = -^ for Testing o-j — irj against tri > o-j 
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B. Analysis of variance, We shall consider the analysis of variance layout 
where a sample of n items is drawn from each of vi normal populations with 
common variance (r^ It is required to decide on the basis of the sample results 
whether or not there is any variation among the true means of the m normal 
populations sampled. 

Let X,, he the jth item drawn at random from the ith population, 

1 ” 1 
a, = - zZ > and a; = — /is.. 

M mfZi 

The l^'-test utilises the comparison of the variation among the sample means 
(external variance) with that among the items within the samples (internal 
variance) in order to teat the equality of population means by making use of the 
ratio 


n £ ~ xYm(n — 1) 

S (a;., — x,f(rn ~ 1) 

If F > Fa, where F« is defined as in 5.A, we conclude that the population 
means are not equal. 

In our approach we will assume that the m true lot means represent a sample 
from a super-population, also normal, with variance equal to Since the 
sampling variance of the means is o^/n, the total variance among the sample 
means equals 

<r^/n + (X* = 1 -|- nO^). 

Hence, our hypotheses are 


m-.B = 0 

Hi: e > 0. 

Since F/x“ follows the F-distribution with m — 1 and min — 1) degrees of 
freedom the operatmg characteristic, i.e. the probability for various 0 of accept¬ 
ing Ht , may be obtained from the curves already graphed by setting ni = m, 
ni = nm — m -f 1, and X^ = 1 + nO^. 

In the design of experiments when the numToer of populations is indefinite 
(for example, daily tests) and the total sample size mn is limited, the above 
procedure will enable one to determine what values of m and n give the most 
'powerful operatmg characteristic for the given amount of sampling. For 
example, for mn = 24 operating characteristics for all possible pairings were 
computed and charted. They were observed to cross one another, each combi¬ 
nation in turn becoming most powerful for a given mterval of &. The following 
table gives the best pairings for various intervals of 0: 
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m 

2 

3 

4 
6 
8 

12 


n 

12 

8 

6 

4 

3 

2 


e 

00- 32 
.32- 60 
.60- .91 
91-1.37 
1.37-2.50 
2.50- 


In contrast to the above discussion, mention should be made of P. C. Tang’s 
approach [4] to the power function of the analysis of variance. The basic differ¬ 
ence lies in the method of expressing the alternative hypothesis. Tang expresses 
it in terms of the variance of a finite number of population means We express 
it in terms of normally distributed population means. We believe our approach 
has considerable practical value in control chart analyses where we are interested 
in the quality of the flow of production of a large number of lots. In addition, 
our approach obviates the difficulties imposed by the non-central x^-distribution. 


6. Power function of the normal test. 


A. The statistic u = 


■\/nix — a) 
<ri 


is used to accept or reject the hypothesis 


that the mean, n, of the normal population sampled, is some specified standard 
level, a, when the population standard deviation is known (for example, from 
past data). 

Our hypotheses are 


Ha’, n = a 

Hi; I M — a 1 = , (X > 0). 

To test the hypothesis m = a, we choose a significance level, a, and compute u. 
If I M I > Wa , where the percentage point, u„ , is determined by 


(3) 


1 


"\/ 2ir 


f +“a 

Un 




dx = I - a, 


we reject Ho and conclude that n a. 

To set up the power function we note that; 
If Ho is true 


Pr{ —Ua < u < H-Ua} = 1 — a 

If Hi IS true 

Pr|-Ua < ^ < u„| = /3, (1 — ,3 = a if X = 0), 

= Pr|—Ua + X Vn < - — < Uff + X Val 

I M — Q I 


where X = 
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In the latter expression the statistic 


- — IS normally distributed with 

Cl 


zero mean and unit variance. The requhed probabilities are found easily from 
tables of areas under the normal frequency curve. By computing enough 
points (X, ;8) the operatmg characteristics depicted in Fig. 6 were constructed. 

It should be noted that the /3 correspondmg to a pair of values n' and X' may 
be obtained from any other operatmg characteristic by use of the relation X = 
X'y/ n'/n For example, if it is desired to find the Type II error for a sample 
size of n' = 12 and X' = 1, select any operating characteristic, say for n = 3, 
as the reference curve, compute X = 1\/12/3 = 2, and find frorri the curve for 
n = 3 that /3 = .07 In Fig. 6, however, individual operatmg characteristics 
are plotted for convenience and to provide a picture of the comparative effi¬ 
ciency of various sample sizes. 

Example. Pressure-measuring instruments are being tested against a standard 
level It has been decided that instruments whose true mean reading is as 
much as 10 pounds per square mch away from the standard level should be 
rejected 95% of the time. On the other hand only 5% of instruments whose 
true mean readmg equals that of the standard should be rejected From past 
data, it is known that all test instruments of the type being considered have a 
stable standard deviation of 5 psi. If rejection or acceptance is to occur on the 
basis of a single sample and the normal criterion of significance, what sample 
size should be chosen to accomplish this purpose? Referring to Fig. 6 with X = 
10/5 = 2 it is seen that a sample size of 4 provides the required assurance. 

B. In sampling two normal populations n and ttz , the statistic 


a/ < j \ ln \ -f- ctz/hz 

is used to accept or reject the hypothesis that = Mz ■ For generality it will be 
assumed that the population standard deviations o-: and crj may not be equal, 
although they are known accurately. 

Our hypotheses are 

Ha- = m 

Hi: I w — in 1 = Xffi. 

Significance is determined in the same manner as in 5.A., and the power 
function is set up in identical fashion. The value (3 is found to be the area 
under the standardized normal curve between the abscissas. 

V k^ni + Hi 


where <12 = kai. The value of /? may easily be read from Fig. 6 for any X', ni , 
n %, and k by selecting the curve for a convenient sample size, n, on Fig. 6 and 
taking 




W] rii 


k^ni -h 712' 
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7. Power function of the <-test. 

A. The statistic i = - — is used to accept or reject the hypothesis that 

£ 

the mean, n, of the normal population sampled, is equal to some specified level, 
a, when the population standard deviation, vi, is unknown. 

Our hypotheses are 

Ha: H = a 


ifi: I — a I = Affi, (X > 0). 

In order to test the hypothesis n = ay/t choose a significance level, a, and com¬ 
pute the statistic t = —— -^. If ] 11 > t* , where the percentage point, 

s 

t„, IS determined by 

r[i(n - IjlVn-lVir i-ia V — V ^ 


we reject Ha and conclude that p a. 

To set up the power function we note that: 
If Ha is true 


Pr(-ia < t < +{al = 1 - a. 


If Hi is true, 


Pr{-«, <«</„!= d, 
However, we have the identity 

L <^1 Oi. 


(1 - |3 = if X = 0). 


— + XV= Pr (— fa < i < +/„} 


where X = 



Hence, for any fixed —, the above probability may be 
0\ 


denoted by say h{s/a{) or, using the notation of section 



evaluated as the area under the standardized normal curve between the abscissas 
indicated. Then 


where /(x^) is the probabdity density function of x* for 7i — 1 degrees of freedom. 
This is one method of evaluating ^and it was used for calculating the operating 
characteristics for a < 5. 

It has been noted that such a formula had been employed by Neyman and 
Tokarska [6] in calculating Type II errors where only one tail of the <-curve is 
used as the region of rejection. Probabilities calculated in this manner are 
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provided by Neyman and Tokarska for degrees of freedom n = 1 to 30 and Type 
I errors of 01 and .05. As soon as the area in one tail of the non-central f-dis- 
tribution becomes negligible these curves are equivalent to the test treated 
herein ivith an a of .02 and 10 respectively. An idea of the critical values of X 
at which this occurs may be obtained from a table in a succeeding footnote in 
which they are quoted for a = .05. The values are surprisingly small, such that 
almost all of Neyman’s figures can be interpreted for a two-tail region of re¬ 
jection. 

Using C. C. Craig’s development of the non-central t [7] we obtain'* 


& = Pr 


~t„ < + VnX ^ 


s/a\ 




^ r! _ 


(r + l/2),Kn- 1); 


n — 1 




where I{'p, q-;c) represents the Incomplete-Beta Function Ratio [7]. This may 
be conveniently used for those values of n where the necessary values are obtain¬ 
able from Tables of the Incomplete-Beta Function ratio [8] and for small values 
of X where the above series converges rapidly. 

The method actually used for n > 4, however, made use of the tables pre¬ 
pared by Johnson and Welch [9]. Replacing their X by x to avoid confusion 
with our notation, these tables give values of x tabulated against f, t, and e such 
that 


Ft 


t = 


Z + 5 
y/ w 





where a is a normally distributed variate with zero mean and unit variance, fw 
is distributed according to the x*-distribution with / degrees of freedom, and 
d = ta — xV 1 4- tl/2f. We want 


^ = 1 — Pr[i < —<a) — Pr(< > ia}. 

For those values of X and n for which Pr{< < —ta] is negligible’ we can, for 
any given e, take k = ta and / = n — 1 and read x from the tables, then deter- 


It should be noted that Craig’s formula as published is in error in having i(r -f- 1) as 
the parameter in the mcomplete beta function instead of r -t- i. 

* Values of for which Pr{t < —{ ojl = 005 are listed below, 

/ = n — 1 \ 


4 

.34 

5 

.30 

6 

.27 

7 

.25 

8 

.23 

9 

216 

16 

.159 

36 

103 

144 

.051 

00 

.000 
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mine 5 and finally X from the relation X = 6/\/n After computing /3 = 1 — e, 
the point (X, /3) on the operating characteristic may be graphed. At the ferv 
places Avheie Pr{i < —is not negligible and /3 is needed n:e can for a given X 
take 

_ <0 - S 

“ vi + tim 

and then by reading tt for various values of <, /, io make an inverse interpolation 
for ethus setting values for Pr{i > —/«) and Pr{i > <a). Pinally 

/3 = Prji > —ta\ — Prjt > +ta}. 

It was found that for n > 10 a good approximation for computing operating 
characteristics is given by 

^ = Pr { — \'\/11 < t ia 4“ X's/7i} 

in -which the variable t is distributed as central t with n — 1 degrees of freedom. 
This formula proved to be quite useful in preparation of the operating character¬ 
istics for the t-test. 

Fig 7 presents operating characteristics of the i-test calculated by these 
methods. It should be noted that m using the i-test, alternative hypotheses 
are expressed as so many multiples of the unknown population standaid devia¬ 
tion a-way from the level stated in the null hypothesis In some applications 
the alternatives may be naturally so expressed. In many applications, how'- 
ever, it may be desired to control the distance p — a regardless of the stand¬ 
ard deviation of the lot sampled. In this case, one could place confidence limits 
on the estimate of v, determine the X value corresponding to each estimate, and 
finally obtain limits on the sample sizes or risks involved * 

B. For the case of tivo normal populations, the statistic 

^_ ^1 — X2 

Si2'\/llni l/rh 

is used to accept or reject the hypothesis that = lij -when the two normal 
population standard deviations are unkno-wn but equal to say, ci. 

Our hypotheses are 

Ho: 111 = in 

Hi l \ Hi — H2 \ = X(7l . 

Significance is determined m the same manner as in par 6.A., and, by reason¬ 
ing similar to that in the preceding section, it is found that /3 for a given X' can 
be read from Fig. 7 by takmg 

X = ^ 

_ Vn y ni + m 

^ For a test of this nature in which the power of the test depends only on the absolute 
value of the distance n — o see [10] 




„ r -\/n{x — a)"] _ 

Fig. 7. Operating Characteristics op the i-TssT 1 t —- I for Testing li — a against y . ^ a 
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and n = ni + 71 } — 1. Before a statistical test of this nature is applied the data 
should be examined to verify consistency with the assumption that ci = a 

Example: An analysis of the difference in tensile strength between two types 
of castmgs is bemg conducted. A sample of 10 items is selected from each type 
of casting and the i-test employed to establish superiority of one over the other. 
Experience has shown that the variability in tensile strength for one type of 
casting is comparable to that of the other type. If a is set equal to .05, what 
percentage of the time would our significance test fail to detect a superiority of 
one standard deviation in tensile strength? n = 10 + 10 — 1 = 19 and X = 
,513. Referring to Fig. 7 for this X and n, it is seen that the percentage /3 is 
approximately 45. 

In this paper we have presented power curves or operating characteristics of 
the common significance tests employed but a single sample of items. The 
power of the tests obtained here does not represent the limit that can be obtamed 
for the average amount of inspection performed, say, over many consecutive 
lots. Tests, sequential in character [11], have been shown to be much more 
efficient. Nevertheless, single sampling is often the only practical procedure 
available. Again, the data may be brought to the analyst as single sample 
results collected supplementary to other purposes or prescribed by a standard 
procedure. Finally, in performing a significance test, it is quite important to be 
able to give constructive advice when the data indicate practical differences 
although no statistical significance is found ‘ 

Although sequential tests using variables have been devised, no investigation 
of double sampling schemes for variables similar to the Dodge-Romig [12] 
plans for attributes has, as yet, been designed with the exception of [9]. It is 
believed, however, that such plans would have considerable application for 
mdustry in combmmg efficiency with practicability. 

The graphs of the operating characteristics in this report have been made by 
calculating a sufficient number of points to draw them m by use of French curves. 
Considering this method of plotting slight error should be allowed for in reading 
probabilities of acceptance from the graphs, especially where the curves are 
steen. 
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MINIMAL VARIANCE AND ITS RELATION TO EFFICIENT 
MOMENT TESTS 

By J. R. Vatnsdal 
fState College of Washington 

1, Siunmaiy. When, a curve is fitted to a set of data by moments, the usual 
procedure used in testing the hypothesis that the population is of the given form 
with the parameters as computed from the moments is to compare the higher 
moments with their expected values as determined by the hypothesis Gen¬ 
erally speaking, moments about the mean are computed although the reason for 
this is not clear. To shed some light on this question, the sample given in the 
mtroduction is fitted to two curves. Moments about various points are com¬ 
pared with their expected values and the discrepancy in standard units ex¬ 
amined. This discrepancy is found to vary widely and to have a maximum. 
The notion of equivalent moment tests is introduced, and on this basis the most 
efficient moment test is defined m such a way that of all equivalent moment 
tests, this one is most likely to reject a false hypothesis. 

For any moment it is shown that there is a point about which its variance is a 
minimum. The conditions are found which determine the position of this point 
for second and third moments. It,is proved that for symmetrical populations 
the variance is minimal when the moments are computed about the mean of the 
population. If the population is an asymmetrical Pearson frequency function, 
it is proved that the point about which the third moment variance is minimal 
differs more from the mean than does the correspondmg point for second mo¬ 
ments. The condition is pointed out for which this is true in the general case. 

The third and fourth standard semi-mvariants of second moments of minimal 
variande are computed and compared to those of the second moment about the 
mean. The ratios of these are displayed for some populations to illustrate how 
this may be used to investigate when the approach to normality is more rapid 
in one case than m the other. Some examples are presented to contrast these 
and other tests. 

2. Introduction. In testing the hypothesis that a given set of observations 
is a random sample from a completely specified population (either a priori or 
specified by a consideration of the sample), generally the Chi-square test is 
applied or certain functions of the moments are compared with their expected 
values and the significance of their departure as determined by the hypothesis 
is examined. 

In the Neyman-Pearson theory it is required that the functional form be 
known. The hypothesis then is some statement concerning the parameters. 
The main principle there used is that the test used should be such that, while 
keeping the probability of rejecting the hypothesis ivhen true at a certain sig- 
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nificance level, it will mmimize the chance of accepting the hypothesis when 
some alternative is true. 

However, if the functional form is regarded as unknown, the alternative hypoth¬ 
eses are then usually unknown. The test then must be one that does not 
depend on alternatives In the light of incomplete knowledge of the distribu¬ 
tion of sample statistics, and since moments of moments are practically the 
only ones known, we shah here use the principle of comparing observed moments 
with their expected values. It is known that the distribution of moments in 
large samples is asymptotic to the normal distribution if the appropriate mo¬ 
ments of the population exist [1]. Here we shall confine ourselves to such 
populations and large samples. 

To introduce the idea which underlies the theory here presented, consider a 
simple example. Suppose a sample is given and the hypothesis is of the form 
f(x, d) with 8 = 6a. Furthermore, suppose the first moment of the sample is 
equal to its expected value. If a second-moment test is used, this means that 
one computes the arithmetic mean of the squares of the deviations of the elements 
of the sample about some pomt, and compares this with the theoretical moment 
about the same point. Generally speaking, the pomt used is the mean of the 
population or the mean of the sample. However, the pomt may be chosen in 
any manner. For each such choice a test can be devised such that the prob¬ 
ability of rejecting the hypothesis when true is e. All such tests are called equiv¬ 
alent moment tests. Among these equivalent moment tests, one particular 
second-moment will have the minimal variance. This one is here called the 
most efficient moment test. 

This test has the property that the range of values of the second moment for 
which the hypothesis is accepted is as small as possible. Thus of all equivalent 
second-moment tests, this one is most likely to reject a false hypothesis. 

This idea may be easily extended to moments of higher order, in all of which 
the concept of minimal variance is fundamental. The point of view may be 
taken that the point about which the moments are computed should be such 
that the variance is a mmunumj or what is equivalent, the variance of moments 
about the origin is minimized by choosing the origin properly 

An example is here presented to bring this out more clearly. A sample of 
1,000 items is given and fitted by the first two moments to two different fre¬ 
quency functions (The sample items are not given here; they are to be found 
in Tables for Statisticians [2]). The third and fourth moments have been 
computed and the discrepancies in standard units as determined by the 
hypotheses are exhibited in a table. 

This sample of 1,000 items considered as a sample from an infinite population 
has these moments: 

m[ = 139.288 
rn’i = 19692.452 
ms = 2827467 388 
m[ = 412561061.04 
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By fitting the first two moments of the sample to curve A, 

B+l 

y = _ - _ a-" 

^ r(n + 1) ^ " 

we get a = 0.4781516735 and n = 65.60079029; to curve B, 


y 


1 


<rV^ 

we get n = 139.288 and <r® = 291 305056. 

The discrepancy between the observed and theoretical rth moment about any 
point is measured by 


i = 


It 

nir 


ft 

Mr 




— Mr 


n 


in which nir is the rth moment of the sample of n about this point, and ti" is 
the rth moment of the population about the same point. 

The values of | i | have been computed corresponding to various points for the 
third and fourth moments. These are exhibited in four tables, given below. 

Examination of the table for the discrepancy between the observed and theo¬ 
retical third momenta for curve B, shows that when this moment is computed 
about 1 = 0, the hypothesis is accepted at the 1% level, this is also true for * 
= 39.3, but for x = 139.3 the hypothesis would be rejected at that level. It 
is evident that some rule must be established to decide what point is to be used 
to make the test. 

If the curve is fitted by the first tAvo moments the value m,' - is the same 
for every point. This is easily demonstrated, for if mt and p's' are measured 
about a point h units to the right of the origin, m" = - Shm'i -f - A’ 

Md M3 - 3 Am2 + 3Am; - A^ Now, m'^ = m', and m[ = mi'. It follows 
tnat wa — fn = 

The maximum value of ] «] is attained when the variance of third moments is 
a minimum. In this manner it is assured that the range of values for which 
the third moment is accepted shall be a minimum. 

If the third moments agree, or the agreement is sufficiently close such that the 
ypothesis cannot be rejected, 7714 - M 4 is constant or varies only slightly from 
pomt to pomt, so that minimizmg the variance yields the maximum value of 
s is seen from the tables above, when the moments are compared at the dif¬ 
ferent pomts, the hypothesis may be accepted for one point and rejected for 

tTthe point which yields the minimal variance, 
the hypothesis will be rejected more often than for other points. Thus, of all 
equivalent moment tests, this one is most likely to reject a false hypothesis 
oh J /ietermmmg for various moments how the origin may be 

^ variance of the distribution of these moments shall be a 
mmimum is now considered. 
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3. First moments. In the case of the first moment, whose expected value is 

the mean of the population, the variance is given by It is obvious 

n 

that the choice of origin does not affect the variance of the first moment, since 
it is well known that — n'l is mvariant with respect to choice of origin. 

4. Minimal variance of second moments. The variance of second moments 
about an arbitrary origin is -(ni — n'^). Expressed in terms of n[ and central 


TABLES 


Curve A. 


Third moments 

Fourth moments 

Point 

t 

Point 


0 

.0365 

0 

.197 

50 

.084 

50 

,697 

100 

.33 

100 

4.74 

120 

.77 

120 

14.17 

130 

1.28 

130 

26.76 

140 

1.91 

140 

49.03 

142 

1.95 

145 

45.26 

145 

1.90 

150 

42.89 

150 

1.60 

160 

21.31 

160 

.95 

180 

6.25 

170 

.57 

200 

2.51 

180 

.37 

300 

.183 

200 

.18 




Curve B. 


Third momenta 

Fourth, moments 

Point 

i 

Point 

t 

0 

085 

0 

.02 

39.3 

.19 

39.3 

.13 

89.3 

.69 

99 3 

,88 

109.3 

1 16 

109.3 

1.09 

119.3 

2.39 

119.3 

2.00 

129.3 

4.05 

129.3 

3.18 

139.3 

5.57 

133.3 

3.83 

149.3 

4.05 

135.3 

3.96 

159.3 

2.39 

137.3 

3.93 

169.3 

1.16 

139.3 

3.67 

179.3 

.98 

140.3 

3.46 

189.3 

.69 

143 3 

2.72 

199.3 

.50 

148.3 

1.59 

209.3 

.38 

159.3 

39 

239.3 

.19 

179.3 

.13 



239.3 

.07 


moments, this may be written 

(1) /I2(m0 = + 4/X2 Mi”)- 

n 


Here it is evident that the variance of second moments does depend on the 
choice of origin, and is not invariant under translation. 


The minimum value of is given by /xi = — — and is-lm — — —Y 

2(12 n\ 

Then we may write 


(2) 
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Throughout this paper m* denotes the second moment of the sample about 

an origin chosen such that ^, which is the value of ni which minimizes 

( 1 ); ml denotes the second moment about an origm chosen such that n[ ~ 0) 
mi denotes the second moment about the mean of the sample. It may be noted 
that in large samples the distributions of ml and inz are approximately the same. 

It IS clear from ( 2 ) that if ^13 = 0 , or, if the population is symmetric, i.e. f{—x) 
= f{x), then /li {mt) = fisiml). However, if ps 7 ^ 0 then ixiimt) < Haiml). 


6 . A moment inequality. Since the quantity given by (2) is essentially 
non-negative, an inequality is obtained valid for any distribution in which the 
first four moments exist, viz. 

2 

('3) P) — P2 — — ^ 0, P2 5 ^ 0 

P2 

or m standard moments 

(4) — as 1 > 0 

This is a stronger mequality than tho one given by Beitelsen [3], i.e. al — 
a 4 — 2 < 0 or the one generally Imown, at > al, [4]. This inequality, however, 
was known to K. Peaison [5, p 432], although he derived it from a different 
point of view. 


6 . Minimal variance of higher moments. The variance of the distribution 
of rth moments of random samples about an arbitrary origin always has a 
minimum. The variance of mi is given by 

(5) p2(mr) = - (p^r — pi’*). 

n 

This expression when expanded in powers of p] is always a polynomial of even 
degree with the coefficient of the highest power a positive number. Further¬ 
more, by differentiatmg iiiimi) with respect to pi and equating the derivative to 
zero, the value of pj which minimizes P 2 (wil) will be found among the solutions 
of that equation 

For third moments of samples the variance is given by 

M) = - [p 1 - pfi 

which, when expressed in terms of moments about the mean and powers of the 
mean, becomes 

(6) P2(m3) = -[pe - P3 + 6 (p 6 - P8P2)p( + (16p4 - Qufjni + ISpsp]® + 9P2 Pi^]. 
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Differentiating with respect to /ii and equating to zero, we have 

(7) + (5pi4 — 3 m4)mi + (#is — = 0- 

By straightforward application of the methods of solving cubics, it is easy to 
show by means of (3) that (7) has one real root only, which moreover is ^ 

as 

at — asd^n — fas — 1) | 0. 

Since it can also be shown by means of (3) that the second derivative of ( 6 ) is 
positive, this root of (7) will minimize litimi). 

These facts demonstrate: 

Theorem I. The point about which the anihmehc mean of the cubes of the 
vanates has minimal variance is to the right, at, or to the left of the corresponding 
point for the squares according as 

(8) at — a3(^o!4 — ^0-3 — 1) I 0. 

By examination of (7) it is readily seen that if as = 03 or if the population is 
symmetric, the real root will be zero; so that for such a population the variance 
of third moments is a minimum when moments are taken about the mean of the 
population. If as 7 ^ as the variance of third moments will be a minimum when 
taken about some other point 

For fourth moments of samples the variance is of the sixth degree in fi[ and 
its derivative therefore of the fifth degree. There is not much to be said in a 
general way except that if a? = 0403 or if the population is symmetric, /ii “ 0 
will cause this derivative to vanish. 

If the distribution is a Pearson frequency function, from the recursion formula 
for the moments [ 6 , p 24], 

/2a4 H“ 4 -|- 25'\ 

V - i - 5 ; 

where 


— ^ accordmg 


2a4 — Sas — 6 
a4 + 3 


The criterion ( 8 ) can be written 

/Qs /2a4 + 4 + 25\ , ,3 . 

(9) as I- \ _ Q -) + aj +|a3 — jaias . 

It will now be shown that (9) % 0 according as 03 % 0, since (9) is 03 D where 


( 10 ) 


D = 


204 + 4 + 25 


+ 1 + 5«3 ~ iai 


1-5 
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It suffices to show that Z) > 0 for all Pearson curves. Using the method of 
Lagrange multipliers, it is possible to show that within the permissible range of 
values of the variables involved, the g l.b. of D is and so U > 0. It has been 

proved that the variance of the squares is a minimum when ni = o-. It has 

just been shown that the sign of (9) agrees with that of at. These, together with 
Theorem I, demonstrate 

T 3 E 0 HBM II. For Pearson frequmoy functions, aj 9 ^ 0, the point about which 
the variance of cubes is a minimum deviates more from the mean than does the cor¬ 
responding point for the squares. 


7. Symmetric populations. For the distribution of rth moments of samples 

(11) thfm!,) - - (fl^r — /r^). 

n 


To find the minimum of (11) expand in terms of central moments and powers 
of Ui, differentiate with respect to p'l, and equate to zero. This yields: 


( 12 ) 


(2r - 2)rV,/ir'-" + • • • + 


-E 


r 

0 \i 


K 


+ • • • + 2r(p2f_i — PrPr-l) = 0 . 


For each power of , the coefficient is an isobario moment function and is of 
even weight when the power of is odd, and of odd weight when the power of 
Ui is even. If the population is symmetric the coefficients of even powers will 
vanish as will the constant term. Then ui will be a factor, the other factor 
being a polynomial with only even powers of . In this latter factor, where K 
is even, the coefficient of Kui^~^ is 


(13) (J) 

Since 


(13) may be written 



X 

flWi-X + E bi(nir-K~ n,~tPr~K+i), 


r — i,K even. 


where a, bj are non-negative integers. 

It can be immediately established by use of an inequality due to Tchebyoheff 
|7, pp. 43,168] that naic +21 ^ UiK'Pu and therefore (13) is positive or zero. 

To sutn up, if the odd moments vanish (12) will have a factor ui and a factor 
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which is a polynomial with even powers only of with positive coefficients; 
therefore there is one and only one solution, ni = 0. This establishes 
Theorem III. For a symmetrical population, the distribution of rth moments 
of samples has minimal variance when the origin is the population mean. 


8 . Distribution of second moments. To study in more detail the distribu¬ 
tions of W 2 and ml the higher moments are computed and compared. Applsdng 
the formula for the distribution of rth moments we obtain, for ml 

liiiml) = 

Piiml) = i(fi4 — pi) 
n 


(14) 


etc. 


at 


/ 0\ ae — 3 q!4 + 2 

(rf) - 3 - If y . - ft- + - 3 _ 3 ] 

nL («4 - 1)* J 


For the distribution of mt, we get 
Pi(mt) = IJ 2 + ^ 

piim*) = -(m — pi — —^ 
n \ yi/ 

f *\ "■ + 2 + 3a5 — SastKa -f- Soial 

-- 

ai { m *) — 3 = —[(as — 4a8 6a4 — 3 -|- 12a5 


4 

as 


as 


— 6 a# 


40703 -f- 00603 — 120403 -7- 4 o3 — 40805 

+ 04aJ)(04 — 03 — 1)"““ — 3] 


etc. 


Computing the ratios of 03 's, we have 


(16) 


a3(7n 


osCmS) 
Similarly 

ai(m*) - 


= fi¬ 
ll) L 


03(3(05 — Oi) — 03(804 — 03)1 
06 — 3o 4 -f- 2 


n/ 2 \-»/2 


04 (^ 1 ) ~ 3 

_ [i _ 03(407 + 604O3 4 - 40306 -b 1203 — 12 o 6 — 606O3 — Oa — O3O4) 
L 08 — 4o 6 — So* -{- 12 o4 — 6 


'] 
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It is evident that when as = 0, the ratio in each case is unity. These ratios 
seem too involved to mahe any other general statements, but for particular types 
of populations these ratios in terms of the parameters are considerably simplified. 
To illustrate this statement, consider 


From the foregoing formulas we compute 

= Af + i, Aiml) = M 

, 2M^ ,0, + 

w(wis) = -, = --- 

n 


n 


(18) 

(19) 


Mmt) _ ./2 (2M+ 1 )“'* 

a,(wi?) y M 8Af* + 22Af + 1 

Mmt) - 3 _ (12M' + 36K + 2)(2M + 1)’ 
04(7715) - 3 “ Af(48Af» 4- 384MS + 112M + 1)' 


The minimum value of (18) is 0.71 for = 1.22 and (18) is < 1 for M > 0.31. 
The minimum of (19) is 0.70 and is < 1 for Af > 0.62. For the Poisson dis¬ 
tribution, then, not only is the variance of m* less than that of ml , but at least 
as far as the first four moments are concerned, the distribution of m* approaches 
normality more rapidly than does m\ for all values of Af > 0.62. 


"When one follows the same procedure for 


r(p) 




' it is found that not only 


IS the variance of mj less than that of m \, but as far as the first four moments 
are concerned, the distribution of m* approaches normality more rapidly than 
does ml, for values of p > 0.7. 

In the case of higher moments, it seems desirable to solve the necessary equa¬ 
tions in each particular case, since the equations are somewhat involved. 


9. Examples. A few examples are exhibited to illustrate the foregoing ideas 
and to contrast with some of the other methods. 

1 . A sample of 1,000 is obtained with the following distribution 

i: 0 1 2 3 4 

/: 625 269 91 11 4 

The hypothesis being tested is that the population is/x = ^_ _ ,withAf = 

0 . 6 . 

X = 0.5 and therefore the mean does not differ from its expected value. 

By using the nit test, we compute t = 2.06. If m* is distributed normally, 
the hypothesis is rejected at the 5% level. By using the ml test, we find t = 
1.45, and therefore by this test the hypothesis is not rejected at the 5% level. 
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Applying the x test, we find that the hypothesis is not rejected at the 5% level. 

2. We return now to the sample mentioned in the introduction. 

Since the parameters in population A were found by fitting the first two mo¬ 
ments, the tests will be made on the higher moments. From the definition of 
ml and m* it is clear what is meant by ml , mt, ml and mt. 

Consider the discrepancy of third moments in standard units t as a function 
of h, the distance from the origin. It is easy to see that 

t = (ml - mO/VG. 


where 

C? = - [Me “ Ma* ~ 6 /i(m 6 ~ MaMa) + 3h^(5tn — Sfii — 
n 

— ISA (nz — MaMi) + 91 i*(m 2 — Ml*)]- 

For the ml test, h = 139.288. The value of h which minimizes the variance 
is a solution of 6 (m 2 — — 9(Ma — MaMi)^* + ( 5 m 4 — 3 fi2 - 2n'zn[)h - 

( it's — it'sli'i) = Oj which, for this population is h = 142.66. Using these values 
and computing, we find, for the mS test, t = 1.90 and for the ml test, t = 1.95. 

Using the same methods applied to fourth moment tests, we obtain for the 
ml test, h = 139.288 and t = 48.7, and for the ml test, h = 143.73 and t = 
52.4. 

The X* test cannot be used here since the moments alone are given; further¬ 
more there is some difificulty in interpreting it rmder these conditions. 

In this particular example, the third moment test would not reject the hypoth¬ 
esis at the 1 % level, while the fourth moment test would reject at that level. 

3 . Since population B is symmetric, it is known that the ml and ml tests are 
identical; similarly for ml and ml. For the ml test, t = 5.67, which would 
reject the hypothesis at the 1 % level. The fourth moment test would not be 
applied in practice. 

The writer wishes to acknowledge his indebtedness to Professor P. B. Dwyer 
for counsel and guidance. He also wishes to thank Professors H. C. Carver and 
C. C. Craig for valuable suggestions. 
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TOLERANCE LIMITS FOR A NORMAL DISTRIBUTION* 


By A. Wald and J. Wolfowitz 


Columbia University and University of North Carolina 


Summaiy. The problem of constructing tolerance limits for a normal uni¬ 
verse is considered, The tolerance limits are required to be such that the prob¬ 
ability is equal to a preassigned value j3 that the tolerance limits include at least a 
given proportion 7 of the population. A good approximation to such tolerance 
limits can be obtained as follows: Let & denote the sample mean and s® the sample 
estimate of the variance. Then the approximate tolerance limits are given by 


X 



and 


$ + 



rs 


where n is one less than the number N of observations, x ndenotes the number for 
which the probability that 'with n degrees of freedom will exceed this number i,s 
and r is the root of the equation 


V 2t Jl/y/if-r 


y- 


The number Xn,^ can be obtained from a table of the x* distribution and r can be 
determined with the help of a table of the normal distribution. 


1. Introduction. The problem of setting tolerance limits for a distribution 
on the basis of an observed sample was discussed by S. S. Wilks [1], [ 2 ] and by 
one of the present authors [3], [4]. For a univariate distribution the problem may 
be formulated briefly as follows: Let x be the chance variable under considera¬ 
tion and let ii, ■ ■ • , Kjv be a sample of N independent observations on x, Two 
functions, Li and Li, of the sample are to be constructed such that the probabil¬ 
ity that the limits Li and Lj will include at least a given proportion 7 of the popu¬ 
lation is equal to a preassigned value /3- The limits Li and Lj are called tolerance 
limits. 

The following two cases have been treated in the literature: CD Nothmg is 
known about the distribution of x, except perhaps that it is continuous, or that it 
admits a continuous probability density function. ( 2 ) The functional form of 
the distribution of x is known and only the values of a finite number of parameters 
involved in the dist ribution of x are unknown. We shall refer to ( 1 ) as the non- 

‘ This paper reports work done by the authors in the Statistical Eesearoh Group, Divi- 
sion of War Research, Columbia University, under contract OEMsr-618 with the Applied 
Mathematics Panel, National Defense Research Committee, The work was first reported 
in an unpublished memorandum, “Tolerance Limits for a Normal Distribution” (SRG 
number 392, 3 January 1945) written by the authors, of whom ono was a staff member and 
the other a consultant of the Group. The problem was suggested by W. Allen Wallis on 
the grounds that the limits previously proposed (see [4], section 6) are unsatisfactory for 
moBt practical purposes. 
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parametric case and to ( 2 ) as the parametric case. An exact solution of the 
problem for univariate distributions in the non-parametric case has been given 
by S. S. Wilks [1] His results have been extended to multivariate distributions 
by one of the present authors [3]. An asymptotie solution of the problem in the 
parametric case, which may be used for large samples, was given in [ 4 ].* 

In the present paper we shall deal with the problem of setting tolerance limits 
for a normal distribution with unknown mean and variance. Approximation 
formulas are obtained which differ from the exact values by a magnitude of the 
order \/N^. They give much closer approximations to the exact values than 
those which can l>e obtained by applying the general asymptotic results in [ 4 ] 
to the normal distribution In addition, the approximation formulas in the 
present paper have the advantage of considerable simplicity and can easily be 
computed with the help of tables of the normal and x distributions. To estimate 
the closeness of the approximation of the formulas given in this paper, a method 
of computing upper and lower limits for the exact values has been derived. Com¬ 
putations show that the approximation is good even for small values of N. A few 
numerical examples arc given in section 7. 

2. Precise formulation of the problem and notation. Let xi, • • • , aiy be i\f 
independent observmtions from a normal population with mean n and variance 
a, both unknown. We shall denote by £ the arithmetic mean of the observa¬ 
tions and by i the sample estimate of the population variance a, i.e., 

( 2 . 1 ) 

and 

(2.2) s’ = £- -1 j where n = N — \. 

For any positive X we shall denote by s, X), or more briefly by A, the propor¬ 
tion of the normal universe included between the limits S — Xs and i -f- Xs, i.e., 

A is a chance variable, since the limits of integration are chance variables. In 
this paper we shall deal with the problem of determining the value of X so that 
the probability that A exceeds a preassigned value 7 is equal to a preasaigned 
value /3. The desired tolerance limits will then l)e given by £ — Xs and £ -|- Xs, 
respectively. In practice, the values and 7 will usually be chosen near unity, 
frequently > .96, 


'Although the reeulte obtained in the non-paramotric case could be applied to the 
parametric case as well, it would not be satisfactory to do so, since for the panunetrio case 
methods having greater efficiency eon be devised by taking into aocount the available in¬ 
formation regarding the functional form of the distribution. 
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It can be verified that the distribution of A does not depend on the unknown 
parameters ft and <r. Thus we can assume without loss of generality that /i = 0 
and 0 - = 1 . 

For any given positive value X we shall denote by P( 7 ,X) 'the probability that 
A > y. For a given value 2 we shall denote by P( 7 ,X | x) the conditional prob¬ 
ability that A > 7 under the condition that the sample mean has a given value 
X It IS clear that P( 7 ,X) is equal to the expected value of P(y,\ \ 2), i.e., 

(2.4) P(7,X) = ^ f P(y, X 1«) d 2 . 

V 2ir 

3. Method of computing P( 7 ,X | x) for any given values y,\ and 2, Since A 
= A(i,s,X) is a strictly increasing function of s, the equation in s 

(3.1) A(x,s,X) = 7 
has exactly one root in s. Denote this root by 

(3.2) s = r(x, 7 ,X). 


Thus, r( 5 c, 7 ,X) is that value for which 


(3.3) 


1_ 

V2v 


Tl^) 




dt = 


7. 


It is clear that Xr( 5 , 7 ,X) does not depend on X We shall write 


(3.4) Xr(i, 7 ,X) = r{x,y). 

Obviously r{£,y) is that value for which 


(3.5) 


1 

'\/2v 


I 


*+r(i.r) 




i-r<S,y) 


7. 


For given values of x and y the value r(x, 7 ) can be obtained from a table of the 
normal distribution. 

Since A(x,s,X) is a strictly increasing function of s, the uiequality A(x,s,X) > 
7 IS equivalent to the inequality s > r(x,y,\) = r(x, 7 )/X. Hence, since x and s 
are independently distributed, we have 


(3.6) P( 7 ,X I x) = P(s > r(x,r)/X) 

where P(s > c) denotes the probability that s > c for any constant c. In gen¬ 
ial, for any relation R we shall denote by P(B) the probability that B holds. 
Since ns has the x distribution with n = N — 1 degrees of freedom, we have 

(3.7) p(^s > 

where x„ stands for a random variable which has the x distribution with n 
degrees of freedom. The probability on the right-hand side of (3.7) can be ob¬ 
tained from a table of the x* distribution 
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Hence, we see that the computation of | x) for given values y,\ and x 
can be carried out in two simple steps. First we determine the value of r{x,y) 
from a table of the normal distribution and then read the value of 




from a table of the x distribution. 

4. Proof that the difference p(^y,\ ■" is of the order 1/N\ It 

is clear that P(y,\ ] x) is an even function of x. Hence, in the expansion of 
P{y,\ 1 a) in a power series in £, only even powers will occur, Termmating 
the Taylor expansion (in section 8 we prove its validity) at the fourth term, 
we have 

(4.1) P(-y,Xli) = P(7,XlO)+|^-^^^M 
where 0 < J < 5. 

The expected value of P( 7 ,X | x) (considering i as a random variable) is 
equal to P( 7 ,X). Since the expected value of is 1/N and the expected value 
of 

x*d*P 
4! dx* *-£ 

IS of the order 1/iV^ (this is proved in section 9), we obtain from (4.1) 


r£* 3V(7,X|x) 
li-o ' 4! dx* 




(4.2) 


P(.,X).P(x,XiO) + ^^5^ 


+ 




On the other hand, substituting l/\/N for x in (4.1) we obtain 

(4 3) f (XA 1 - P(7,X 10)+^^^ L + 4i^- r ' 

where 0 < f' < l/VF. Hence, since the second term of the right member 
of (4.3) is of the order 1/iV*, 


i-t' 


(4.4) 


P(7,X 


^)-W|0) + ^g 


... “(ap) 


From (4.2) and (4.4) it follows that 


(4.5) 


P(y 


,X) - p(y,^ 




Thus, this difference approaches zero rapidly as JV —> <». 


6. Computation of the value X for which P 




takes a pieassigned 


value ^ Denote by Xn .js that value for which P(xn > xl,?) = (9. This value can 
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be obtained from a table of the x* distribution. From (3.6) and (3.7) it follows 
that the required value X* of X is given by the root of the equation 



Thus, the desired value of X* is given by 

(5-2) ^ 


{vn' 

normal distribution.’ 


The value r 


is defined by (3.6) and can be obtained from a table of the 


( 6 . 1 ) 


6. Lower and upper limits for P{y,\) As mentioned in section 2, P(y,\) is 
equal to the expected value of P(y,X | x). Thus, 

P(7,X) = ^ r“p(y,X\i)e-^''*’dx. 

V2ir ■'-'o 

To obtain upper and lower limits for P{y,X), we shall construct upper and lower 
limits for the integral on the right-hand side of (6.1), It can easily be seen 
that P(y,X I i) is a strictly decreasmg function of x*. Hence, to obtain lower 
and upper limits for the mtegral in the right member of (6.1) we can proceed 
as follows; Choose a positive constant d and a positive integer k. Denote by 
ai the probability that fd < x < (f + l)d, (f = 0, 1 , • • •, fc— 1) , and let a* be the 

probability that x>hd. Then 2^ 1 fd) is an upper bound, and 2^ Ui-i 

1-0 

P{y,X I id) is a lower bound of the integral in question. Thus 


1-1 


( 6 . 2 ) 

and 

(6.3) 


P(7.X) > 2Eo<_iP(7,Xlid) 

1-1 

P(7.X) < 2i:a.P(T,Xlfd). 


The two limits can be brought arbitrarily close to each other by choosing d 
sufficiently small and k sufficiently large. A method of computing P{y,\ ] x) 
for any given value x has been described in section 3 and the quantities can 
be obtained from a table of the normal distribution. The amount of compu¬ 
tational work, however, increases rapidly with increasing fc. 

‘ The Statistical Research Group computed, under the supervision of Albert H. Bowker, 
a table of tolerance limit factors X (see formula 6.2) for (J » ,76, ,90, .96, .99; y •= .76, .90, 
.96, .99, 999, N = 2 (1) 102 (2) ISO ( 6 ) 300 (10) 400 (26) 760 (60) 1000, Mr. Bowker also 
developed an asymptotic formula for X (published elsewhere in this issue of the Annah) 
which, when < .99, 7 < 999, and N > 160, agrees with (5.2) to within 1 unit in the third 
signiheant figure The Applied Mathematics Panel plans to publish the table and a brief 
explanation of tolerance limits in the volume entitled Techniques of Slolisticul Analysis de¬ 
scribed in the footnote on page 217. 
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7. Approxunate determination of the tolerance limits. The exact tolerance 
limits are given by x — Xs and x + Xs where X is the root of the equation in X 

(7.1) P( 7 ,X) = j3. 

This equation has exactly one root in X, since P{y,\) is a strictly increasing 
function of X. Denote this root by X = X(d,T)" Thus, the exact tolerance 
limits are given by x — X(/3,7)s and x + X(|8,y)s. 

We have seen in section 4 that p(y,'K j closely approximates P(t,X), the 

difference being of the order 1/N^. Thus, a close approximation to X(d, 7 ) can 
be obtained by solving the equation in X, 

(7.2) P(7,Xl;^) = d. 

This equation has again exactly one root in X, since P ^ 7 ,X | ^^^^^is a strictly 

mcreasing function of X. Denote the root of equation (7.2) by X = 

Thus approximate tolerance limits are given by x—\*{p,y)s and x-l-X*(| 8 , 7 )s. 
In section 5 it has been shown that 


(7.3) X*03,7) a/ 

f Xn«l 3 

where n = iV — 1, Xn,fl is that number for which the probability that with n 
degrees of freedom exceeds this number is /3, and r is the root of the equation 

1 ri/V^+f , 

(7.4) / e = y. 

^ ^ V27r hty/h-T 

The number x»,p can be obtained from a table of the x^ distribution and r can be 
determined from a table of the normal distribution. 

Since X*(d, 7 ) is only an approximation to X(d, 7 ), P[7>X*(|9,7)] wiU differ slightly 
from /3. To judge the goodness of the approximation of X*(d, 7 ) to the exact 
value X(| 8 , 7 ), it is desirable to derive upper and lower limits for the difference 
P[ 7 ,X*(d> 7 )] — |3- Such limits can be obtained by computing upper and lower 
limits for P[y,\*il3,y)] using the method described in section 6 . 

We cite here a few numerical examples to show the goodness of the approxima¬ 
tion. 




Upper limit 
of P[y,\*(P,y)] 

Lower limit of 
P[t.X*(|3,7)] 

37,674 

.95202 

.96077 

4.550 

.98989 

.98908 

2.631 

.95161 

.94393 

2.972 

.99024 

.98813 
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8. Validity of the Taylor expansion of P(y,\ | x). Weehall show that P(t,X |^) 
has derivatives of all orders at every point x, y and X being fixed. This is 
sufficient to validate the Taylor expansion used in section 4, 

For typographical convenience write 

rii,y) = R. 

We have 

VTrL^ "• 

Differentiating (8.1) with respect to x we obtain 


whence 

(8.3) ~ = tanh xR. 

dx 

Now the analytic function tanh z of the complex variable z has only purely imagi¬ 
nary singularities. Hence R possesses derivatives of all orders for all real values 
of £, 

Now 

P(y,\ I i) = P (^8 > = 1 “ fc t”~' r”'’"®*’’ dt 

where A is a constant. Hence from (8.3) 

(8.4) ~ = -ftp"'* ^janhxP . 


The right member of (8.4) is a product of functions which are analytic in the 
entire (complex) R plane by a function which possesses derivatives of all orders 
for every real x. Since R possesses a derivative (with respect to x) for all real 
X, it follows that P possesses derivatives of aU orders for every real $. 


9. Proof that 


E 


X* P 
.4! d£^ 



Since P is a minimum at x — 0 it follows that P( 7 ,X | £) has a maximum there. 
Hence, from (4.1), the quantity 



4! dx^ 
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is never positive. Therefore 

d^P 

dx^ 




< 

~ dx^ 


2^ 


is bounded above for | i 1 > 3, where 5 > 0 is arbi- 


d* P 

Consequently — 

oS* |i™{ 

trarily small. Since P possesses everywhere derivatives of all orders, the fourth 
derivative is continuous and hence bounded above for | it | < 5, From this we 
3* P 

obtain that -r-r is bounded above for every real x. 
ox* 

Since P{y,\ | x) is always positive we have, from (4.1), that 


tl 

dx* 


> - 




12 2 ?+ 5 




dx^ 


fnO / 


i-i 


For I «1 greater than a sufficiently large number C, the left member of the 

3* P 

above inequality is thus bounded below. For | i | < C we have that — 

uX 

3* P d* P 

is bounded below because — is continuous. Hence — 

3r 3r 

low for every real x, 
a- 3*P 


is bounded be- 


i-t 




is bounded above and below for every real x, the desired 


result follows. 
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APPROXIMATE FORMULAS FOR THE PERCENTAGE POINTS 
AND NORMALIZATION OF t AND x® ‘ 

Bv Henry Goldberg* and Harriet Levine 

StaUstical Research Growp, Columbia Universily 

1. Introduction. The x® Distribution and Student’s t-diatribution are func¬ 
tions of a parameter (degrees of freedom) and approach the normal distribu¬ 
tion as n approaches infinity. The normal distribution is a good approxima¬ 
tion to these distributions for large n. For small or moderate n, a better 
approximation may be obtained by using a function of i(or x) which approaches 
the normal distribution more rapidly as n increases. Hotelling and Frankel 
[7] pointed out that an additional advantage of the normalization of a distribu¬ 
tion IS that further statistical tests are possible with the normalized variate. 
Normalizing f(or x*) is equivalent to transforming it into a function which is 
normally distributed to a required degree of approximation; that is, a normally 
distributed variate of zero mean and unit variance is expressed as a function of 
i(or x) in powers of X/n. 

The reverse problem of expressing f(or x) as a function of a normally dis¬ 
tributed variate of zero mean and unit variance in powers of 1/n is also of prac¬ 
tical importance in connection with significance tests for which the significance 
levels, or percentage points, of the t and x distz'ibutions are required. 

Cornish and Fisher [1] (see also [2]) have given a metliod for the normalization 
of distributions which approach normality as the number of degrees of freedom, 
n, increases and whoso cumulants are expressed in power series of l/n, so that 
the order of magnitude of the rth cumulant is that of A method has 

also been given for expressing a variate with such a distribution as a function 
of a normally distributed variate of zero mean and unit variance in powers of 
l/n. 

It is the purpose of this note to apply the Comish-Fisher method (1) to the 
derivation of asymptotic formulas for the percentage points of the t and x* dis¬ 
tributions and (2) to the normalization of these distributions. Tables are 
given which inchcate the accuracy of these approximations and compare them 
with other approximations. Tables are also given to facilitate the calculation 
of the approximations for the percentage points of ( and x®‘ 


* This paper reports work done in the Statistical Resenroh Group, Division of War Re¬ 
search, Columbia University under ooniract OEMsr-BlS with the Applied Mathematics 
Panel, National Defense Research Comniittoc, Office of Scientiflo Research and Develop¬ 
ment. The work was first reported in an unpublished memorandum, ^'Application of the 
Cornish-Fisher method to an approximation of the significance levels of I and x’" (SRG 
number S07, April 28, 1945) 

‘ Henry Goldberg died April 19,1945. 
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2. The Comish-Fisher method.’ Consider the random variable y with 
probability distribution function f(y), expected value E(y), and variance <r“(j/). 
Let Kt denote the rth cumulant of y and Or denote the rth relative cumulant of 


K 

y; i.e , Or = . Let x denote a normally distributed variate with zero mean 

K-2 

and unit variance. 

For every p, (0 < p < 1), let Pp be defined by 


and Xj, by 



P 



■\/ 27r 


dr 


P- 


That is, corresponding to every Pp , there is an Xp having the same probability 
mtogral (p). The Cornish-Fisher Method for expressing a normally distributed 
variate with zero mean and unit variance as a function of a standardized variate 
with the same probability integral gives 


(1) a:, i)o + biZp + 6i2p -|- bjZp -(- 642^ + 6s2p -!-••• 


where z, is the standardized variate corresponding to p^; i.e., 

<riy) 

and the hi are defined in terms of the relative cumulants. 

Cornish and Fisher give also the following expansion for a standardized vari¬ 
ate as a function of a normally distributed vanate: 

(2) Zp ‘—' Co "t" C\Xp -t" CjSp -j- CjXp -j- CtXp CoXp • 


where the c, are defined in terms of the relative cumulants. 


3. An approximation for the percentage points of Student’s f-distiibution. 

The standardized variate z = t can be expressed as a function of the 


normal variate, x, in powers of 1/n by using the Comish-Pisher equation (2). 
Omitting terms of degree greater than two in 1/n gives, after simplification, the 
following asymptotic expansion for t: 


(3) 


I X -\- 


3^' + a; , 
in 


5x* + Ifir’ + 3x 
96n* 


’ Churchill Eisenhart suggested the use of the Cornish-Fisher Method for obtaining per¬ 
centage points of the chi-square distribution not given in existing tables, a problem which 
arose in several connections, including the computation of a table of factors for tolerance 
limits for normal distributions according to two formulas devised in the Statistical Re¬ 
search Group, one by A. Wald and J Wolfowitz and the other by Albert H Bowker, both of 
which are published elsewhere in this issue of the Annals of Math. Stat. The table will be 
included in a volume by the Statistical Research Group, Techniques of Slalislical Analysis, 
to be published by the McGraw-Hill Book Company in 1946; its preparation, including the 
work reported in the present paper, was directed by Albert H Bowker; the Statistical Re¬ 
search Group was directed by W. Allen Wallis. 
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For simplicity, the subscript p which appears in the Cornish-Fisher equation 
(2) has ^en dropped. It should be understood, however, that the x and i used 
in expansion (3) have the same probability integral. It is interestmg to note 
that the first two terms were derived by Peiser [4]. 

TABLE 1 


Table of Polynomials Required for the Approximation for the Percentage Points 

of the t-dislribution* 


Probability Integral 
(p) 


1 

Mx) 

Mx) 

.999 

3.090232 

8.150129 

19.692529 

.9975 

2.807034 

6.231221 

12.850916 

.995 

2.575829 

4.916548 

8,834762 

.99 

2.326348 

3.729074 

5.719746 

975 

1.959964 

2.372271 

2 822499 

.95 

1.644854 

1.523769 

1.420203 

.90 

1.281552 

,846585 

.570891 

.75 

.674490 

.245336 

.079490 


*This table can be used for determining x, fi(x) and /»(») corresponding to 
the complements of the selected values of p by using the relations 


p — Xjt 

fl(~x) = -fi{x) 
M-x) = -/ 2 (x). 


To facilitate the use of the approximation, tables of the required polynomials 
in X have been computed for selected probability integrals. The approxima¬ 
tion can be written 

■■■ 

n 

where 

m = 

and 

,. , 6a:' + 16a:' 3a: 

96 

Ta,ble 1 gives values of Xp (or x), fi(x) and /a(.r) for selected values of the prob¬ 
ability integral p Table 2 gives approximate and exact percentage points of t 
for selected values of p and degrees of freedom. The exact values were taken 
from Merrmgton [5]. Table 2 shows the high degree of accuracy of the three 







TABLE 2 

Comparative Table of Approximate and Exact Values of the Percentage Pointy 

of the t-distribution 


Probability 
Integral (p) 

Degrees of 
Freedom 

Approximate Percentage Point 

N ormal 

2 Term 

3 Term 

.9976 

1 

2 8070 

9.0383 

21 8892 


2 


5.9226 

9.1354 


10 


3.4302 

3.5587 


20 


3,1186 

3.1507 


40 


2.9628 

2.9708 


60 


2.9109 

2.9145 


120 


2.8590 

2.8599 

.9950 

1 

2.6758 

7 4924 

16.3271 


2 


5 0341 

7.2428 


10 


3.0675 

3 1558 


20 


2 8217 

2.8437 


40 


2.6987 

2.7043 


60 


2.6578 

2.6602 


120 


2.6168 

2.6174 

.27 BO 

1 

1.9600 

4.3322 

7.1547 


2 


3.1461 

3 8517 


10 


2.1972 

2.2254 


20 


2.0786 

2.0856 


40 


2.0193 

2.0210 


60 


1.9995 

2.0003 


120 


1.9797 

1.9799 

.9500 

1 

1.6449 

3.1686 

4 5888 


2 


2.4067 

2.7618 


10 


1.7972 

1.8114 


20 


1.7210 

1.7246 


40 


1.6829 

1.6838 


60 


1.6702 

1.6706 


120 


1.6576 

1.6577 

.7500 

1 

0.6745 

.9198 

.9993- 


2 


.7972 

.8170 


10 


.6990 

.6998 

, 

20 


.6868 

.6870 


40 


.6806 

.6807 


60 


.6786 

.6786 


120 


.6765 

.6765 


Exact Por- 
centage Point 


127.32 

14.089 

3.6814 

3.1534 

2.9712 

2.9146 

2.8699 

63.657 

9.9248 

3.1093 

2.8453 

2.7045 

2.6603 

2.0174 

12.706 

4.3027 

2.2281 

2.0860 

2.0211 

2.0003 

1.9799 

6.3138 

2.9200 

1.8125 

1.72-17 

i.em 

1.6707 

1.6S7? 

1.0000 

.8166 

.6998 

.6870 

.6807 

.6786 

.6766 
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term approximation for w > 10 and the superiority of this approximation over 
the two-term approximation derived by Peiser. 

4. An approximation for the percentage points of the distribution. The 

standardized variate z = —can be expressed as a function of the normal 

V 2n 

variate, x, in powers of 1/n hyuising the Cornish-Fisher equation (2). Retain- 


TABLE 3 

Table of Polynomials Required for the Approximation for the Percentage Points 

of the distribution* 


G,(x) 


Ot(.x) 


0^ix) 


0,(x) 


Oiix) 


Probability 
Integreal (p) 


.999 

9975 

.995 

.99 

.976 

.95 

,90 

.75 


4.370248 

3.969745 

3.642773 

3.289953 

2.771808 

2.326174 

1.812388 

.953873 


6.699690 
4.586292 
3.756598 
2.941263 
1.894306 
1.137029 
.428250 
- 363376 


.619006 

.193953 

-.073888 

-.290266 

-.486382 

-.554981 

-.539450 

-.346842 


-1.602112 
-1.113149 
-.802518 
-.541971 
-.272398 
-.122957 
-.017722 
.060220 


1.273498 
.875184 
.622768 
.411597 
.194832 
.077898 
.002186 
-.030881 


* This table can be used for determinmg the Ot{x) for values of a: correspond¬ 
ing to the complements of the selected values of p by using the relations 

" Xp 

I 0<{-x) = i-iyOiix), fort = 1, ... , 5. 


ing terms in gives, after simplification, the following asymptotic expansion 
for x’: 

2 , ^ , V » . ^ . V . G}ix) . Gi{x) , Ot{x) . 

(4) X n + Giix)n' + Qi{x) 4- —j- + —— + -j- + 

n n n 

where 

= V23; 

Giix) = - 1 ) 


(6x* -f 14i* - 32) 

= 486^ + 256rt’ - 433 t). 
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Comish-Fister. 67.327G 70.0649 77.9295 82.358190.1332109.141 118.498 124.342 135.807 140.184 

Peiser. 67.336370 070877.9308 82.3583 90.1326109.141 118.498 124.343 135.812 140.192 

Wilson-Hilferty. 67.303270.0494 77.9294 82.361890.1378109.137 118.493 124.340 135.820 140.193 

Fisher. 66.4809 69.388877.6493 82.2427 90.2126109.242 118.400 124.056 135.023 139.154 
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PERCENTAGE POINTS OP t AND x' 

As before, the subscript p which appears in the Oomisb-Fis^ier equation (2) has 
been dropped. The x and % which are used in expansion (4) have the same 
probability integral. The first four terms were derived by Peiser t^]- 
Table 3 gives values of the G^{x) for selected values of the probability in¬ 
tegral p. Table 4 compares various approximations with the exact percentage 

TABLE 5 

Comparaiine Table of Approximate and Exact Values of tho Probability Integral oft 


Probability Integral of / 


t 

n a= 1 

71 =3 2 

n — 

m 

n = 

20 


Approxi¬ 

mate 

Exact 

Approxi¬ 

mate 

Exact 

Approxi¬ 

mate 

Exact 

Approxi¬ 

mate 

Exact 

0.1 

.5311 

.5317 

.5351 


.5388 

.5388 

.5393 

.5393 

1 

.7734 


.7917 


.8296 

.8296 

.8354 

.8354 

3 

1.0000 



.9523 

.9954 

.9933 

.9967 

.9965 

5 

1.0000 

.9372 


.9811 

1.0000 

.9997 


1 0000 

6 

1.0000 

.9474 


.9867 


.9999 


1.0000 


TABLE 6 

Comparative Table of Approximate and Exact Values of the Probability Integral of 


Probability Integral of x* 


x’ 

n =» 2 

n = 

10 

■M 

20 

n = 

29 


Approxi¬ 

mate 

Exact 

Approxi¬ 

mate 

Exact 

Approxi¬ 

mate 

Exact 

Approxi¬ 

mate 

Exact 

1 

.3963 

.3935 

.0010 

.0002 

.0000 



.0000 

5 

.9646 

.9179 

.1098 

.1088 

.0004 



.0000 

10 

1.0000 

.9933 

.5594 

.5595 

.0323 



.0004 

20 

1.0000 


.9768 

.9707 

.5420 

.5421 


. 1071 

30 

50 

1.0000 



.9991 

.9305 

■ 

5H6U 

.9916 

.5860 

.9910 


points of for selected values of p and degrees of freedom. The Peiser four- 
term approximation, the Wilson-Hilferty approximation, 



and the Fisher approximation, 

Xp = + V2n - 1)“ 
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are given for comparison. The exact values were taken from Thompson [6]. 
Table 4 shows the high degree of accuracy, and the general superiority of the 
Cornish-Fisher approximation, for n > 10. For low probabilities (.005) the 
Peiser approximation is often better than the full series, for small n, (1, 2), the 
Wilson-Hilferty approximation is often better. 

6. Normalization of t and The Cornish-Fisher equation (1) applied 
to the i-distribution or, alternatively, a formal reversion of the power series 
(3) gives the asymptotic expansion 

(5) a: ~ tTl - ^ + • ■ • ' . 

L 4n 96n* 

Expansion (5) agrees with the first three terms of an expansion derived by Ho¬ 
telling and Frankel [7]. 

Applying the Cornish-Fisher equation (1) to the distribution gives the 
expansion 

* ~ 38880 vs.' + “““O 

(6) - ^ [53553x^ + 2208x= - 386] + ^ [34257/ + 792x* + 238x1 

- i [25221/ -f 304x1 

71 71 I 

6. Accuracy of the normalizations of t and x^ The accuracy of the normaliza¬ 
tion (5) of i may be judged from Table 5, which compares the approximate value 
of the probability integral with the exact value. The approximate value is the 
normal probability integral corresponding to the value of x computed from (5) 
for the given values of i and n. The exact values ivere obtained from Student’s 
tables [8], For fixed n, the approximation improves as t decreases from mod¬ 
erate to small values. The approximation appears to improve as t increases 
from moderate values (about 3) to large values because of the more rapid ap¬ 
proach to unity of the probability integral of a normal variate. 

The accuracy of the normalization (6) of x* may be judged from Table 6, 
which compares the approximate value of the probability Integra] with the exact 
, value. The approximate value is the normal probability integral corresponding 
to the value of x computed from (6) for the given values of x* and n. The exact 
values were obtained from the table of Pearson [9]. 
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THE EFFECT ON A DISTRIBUTION FUNCTION OF SMALL CHANGES 
IN THE POPULATION FUNCTION 

By Burton H. Camp 
Wesleyan University 

1 , Summary. It is generally assumed in the application of distribution 
theory that, if the actual population function is not very different from the one 
used m the theory, then the true sampling distribution of a statistic will not be 
very different from the one obtained in the theory. But elsewhere in mathe¬ 
matics we do not assert that a conclusion will be only slightly modified by a small 
deviation in the hypothesis. This paper presents some theorems which are 
useful in determining the maximum effect on a sampling distribution of certam 
kinds of small changes in the population function In particular, if the popula¬ 
tion is denoted by the function <j>{t), if a sample of n independent measurements 
(h, ■ • ■ , U) is taken from this population, if a statistic x = g{k , • * ■ , in) is 
formed from the sample, and if I){x) denotes the distribution of this statistic; 
then, when <f) (t) is changed by a small proportionate amount to D{x) will 
be changed to Di(.r), and the relation between D and Dj will be subject to the 
inequality: 

11 ' 

where 

« = (1 1| and I — 11 < 5. 

2. It IS generally assumed m the application of distribution theory that, if 
the actual population function is not very different from the one used in the 
theory, then the true sampling distribution of a statistic will be not very different 
from the one obtamed in the theory. For example, we commonly apply to 
practical problems the distribution theory that has been obtained on the hy¬ 
pothesis that the population is normally distributed even though we know that 
our actual populations are only approximately normal in form, and we commonly 
assume that our results are approximately correct. But elsewhere in mathe¬ 
matics we do not assert that a conclusion will be only slightly modified if we only 
slightly modify the hypothesis. An example of our unwillingness to do this 
in other branches of mathematics is illustrated in the following example. 

Example 1. Let y = (/>(<) have the derivative y' ~ 0'(<). Let be re¬ 
placed by where <l>i — <t> = and | s{l) | < e, e being small. Wo 

have thus chosen to make (^i — small relative to <t> rather than small abso¬ 
lutely so that this example may be useful in another connection. The derivative 
of 01 may of course differ very greatly from as for example in some of the 
approximations made by a few terms of a Fourier series; and it would be a major 
error to assume that the two derivatives are approximately equal. How can we 
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(D - Di)dx 


g t f D(x)dx, 
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be sure that, in the process of finding a distribution function, we are not mairin g 
an error of the same^ sort? 

The following theorems partly answer this question. The theorems will first 
be stated and proved in great generality. Then we shall return to the functions 
in Example 1 as a special case. We shall be concerned with a sample consisting 
of a single observation of n measurements (fi , • ■ ■ , tn) drawn from the multi¬ 
variate universe , ■ • • , 4.), or, more briefly, with the vector T as a sample 
from the n-way universe Throughout this paper and sball be func¬ 

tions which are non-negative and whose integrals over the entire spaces of their 
definition are unity. Let the statistics {xi, • ■ ■ , im), or more briefly the vector 
X, be constructed from T thus: 

(1) xi = gi{T), ,x„ = gm{T). 

If now p represents any ineasurable point set in Z space and if dX is used for 
(dxi • ■ • dxm) and dT for (dti ■ • • dip), a fundamental theorem [1] of distribution 
theory asserts that, if q is the point set in T space for which Z is in p, then the 
distribution D(X) is determined by the equation. 


( 2 ) 


f D{X) dX = f \l/(T)dT, if these integrals exist. 

J n Jq 


Theorem 1. Using the foregoing notation, let yj/{T) he replaced by \f'i(T) and 
let ^i(r) — \p(T) = \f/{T)S(,T), where | S | < «, and as a consequence letD(X) be 
replaced by Di(Z); then 


(3) 


J Di(X)dX - j D{X)dX < t J DiX)dX < 


To prove these inequalities we merely need to notice that the point set g 
depends on the g’s but not on the imiverse, and that therefore we may use the 
same p and g as in (2) in the follo^ving equation which determines Di : 


(4) [ A(Z)dZ = f MT)dT. 

J p Jq 

Subtracting (2) from (4) we obtain 

(6) I f DidX - f DdX = I f (L>i -I>)dX =1 fih- 4')dT 

\ Jp Jp \ Jp I 1 *^3 

= I f ^SdT < « I f = t I f 

Jfl I »/<i •'P 


DdX 


< «. 


^ The general question being raised here has been approached heretofore from differ - 
ent points of view. In particular, other exact population functions besides the normal 
have been studied, and in some cases the distribution theory has not been greatly dis¬ 
turbed as a result Also, the eflfeots of slight changes in the parameters of a population 
function have been studied. 
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since vt* is never negative, and the integral of D is never greater than unity. It 
should be noticed that the final inequality of (5) is independent of the g’s, al¬ 
though this is not true of the preceding inequalities, n'hich do depend on the 
g’s because they involve p and q. 

Corollary. In particular^ let \p = <^(<0 ■ • • where tj)(t) defines a one¬ 
way universe function, and h, ■ ■ ■ , are independent samples from it. Let x = 
g{k, , Q. Then, if (i){L) is replaced by (^nd if g>i — 4> <= 8{t)<l)it), and if 
I s{t) I < S, and if D{x) is the distribution of x before the replacement, and Dfx) 
is the corresponding distribution after the replacement, 

where 

€ = (1 + 2 )" — 1, and —00 < a < b < 

This corollary follows from the theorem because of the universe, 

'f'(h , • • ■ , tn) = <t>(li) • • • d>(ln), 

and 



I ‘' I In) — tt>(.li) • • • + s(h)] •••[!-(' 5(^n)], 

so that, in the notation of the theorem, 

UT) = HT) + 4'{T)SiT), 

where 

S{T) = [s{ti) -h • • • -f- 8 (t„)] + (a(ti) 5 (fe) 4- • • • + s(i„-i)s(<„)] 

+ • • • -f [s(0 • • • s(<i,)]' 

Hence 


8 


nS 


nl 


21(ti - 2)! 


s' + 


S“ 


(1 + S)" - 1 = e. 


The interval (a, b) now replaces the point set p of the theorem. 

This theorem and its corollary are powerful in that they may be applied to all 
statistics, but they are weak because of the restrictions on 8(T) and s(t). It is 
to be noted also that the corollary is ineffective when n is large, a difficulty which 
seems to the author to be implicit in the sampling process. The restrictions on 
s(t) make it impracticable to apply the corollary to the following exai^le since, 
as will be observed, if 1 1 1 > c, ^ = - 4 ,, and so then I s I = 1: and when 
5 = 1 ,« = 2 " - 1 . 

Jxample S. Let <^(0 = (2 t)-^V‘‘'^ in (- co, «), and let MO = A(2ir)-^'^ 
e m ( c, c) and let MO = 0 if j < | > c, where c is not infinite and A is so 
chosGii th&t thfi integral of ov 6 r (— oo ^ oo ^ i 3 unity* 

This type of example is important because, in the attempt to apply the theory 
of normal distributions to practical matters, the first discrepancy that appears 


•One could as well use •. 
its importance. 


but we choose 


the simpler case on account of 
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is that in the theory the given distribution is infinite in extent while in practice 
it is finite. The following theorem generalizes the preceding one so as to permit 
it to apply to this example. 

Theokem 2, Let all of T-apace be divisible into two parts, Qq and Qi, satisfying 
the following conditions. In Qo let - <p(T) = S(T)^iT), and let | S{T) \ < 
E. In Qi let — 0, and let 

[ ^{T)d{T) < ei. 

Then 


f Di dX — f DdX < E f DdX + ej < e -j- 

‘'P ‘'D 


It is not required that Qo or Qi be the totality of points for which its attendant 
conditions are true. 


Proof. As before, if the integrals exist, 



and 




Hence 

f DidX - f DdX = f (rPi ~ 4>)dT = f {^^l - v^)dP + ( ih- f)dT, 

Jp Jp Jg Jqg Jj, 

where qo ie that part of q which is in Qo, and qi is that part of q which is in Qi. 


( 6 ) 

(7) 


1 [ DidX - f DdX 

< 

f (i^i - mT 

+ 1 f (lAi - il)dT 

1 •'P 


1 •'so 

1 *'«! 

f - V')dP = f mT 

1 < f f ypdT 

i Jqt 



( 8 ) 



< £ 

f ypdT = e 

1 DdX, because ^ > Q, 


fq 

Jp 

1 (^1 - ^f)dT 

- 1 

^ f xjid’T ^ Cl j 

Jqi 


because = 0 in gi, The inequalities (7) and (8), when substituted in (6), 
prove tt^theorem. 

Corollary. In particular, let and tc be defined as in the corollary to Theorem 
1 , and let 4>iit) be so defined that, | i 1 < c, 4>i{t) — ~ where as before 

I a(0 I ^ and Ei=(14-i)'' — 1; and, iS\t\ > c, let 0i(Y) = 0. Also let 

/ <i>(i) ■ • ■ <j>{ln)dT < El where Qi is the set where | ] > c for at least one value 

■'oi 

of i. Then 


f Dt{x)dx — f D{x)dx ^ « f D{x)dx +«!<« + 

Jq Ja 


provided these integrals exist. 
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Proof. This corollary is implied in the theorem if we let ^(T) = 

• • • <l>(tn) and ^i(f) = <^i(h) • ■ • and then let Qo be the point set in 
T-space where | <. | < c for all values of i, and Qi be the point set where | ^( ( > c 
for at least one value of i. As in the corollary to Theorem 1, p becomes the 
interval (a, h). 

Example S. Let (p and be as in Example 2, and choose c = 3. Then A = 
1/0.9973 = 1.0027, and 


[ <p(k]-“'l>(in)dT = 1 - (.9973)”. 

•'Ol 

This quantity may be taken as «i. Also 

I (^1 - 0 )/<^ I = IA - 1 i = 0.0027. 

This quantity may be taken as S. Then « = (1.0027)" — 1. Hence 

I ^h i*b 

/ Di(x)dx — j D(x) dx < e I D(x)dx + ei . 

Ja Ja 


If n is not large, an approximate value for both « and ei is O.OOSn. This quantity 
is not particularly small unless n is small, but it could not be expected to be 
very small since the corollary pertains to all statistics of the form x = 

ff(li I • ■' I (*). 

Example4. In one of the author’s earlier papers [2] he found the distribution 
of the geometric mean, x = (ti • • of n observations chosen from the 
universe described by the so-called curve of equal facility, whose equation is 


y = 


1 


i/ap 


The author stated that there was about as good justification for assuming that 
the distribution of statures was given by that universe as for assuming that it 
was normal. After one more theorem we shall now be able to state that, if one 
wishes to cling to the assumption that the distribution of statures is normal, then 
the distribution of the geometric mean is close to the distribution found in that 
earlier paper. We do need another theorem for this because we should be deal¬ 
ing with two distributions, and </>((), which do not obey the requirements of 
the corollary of Theorem 1 , because they approach zero at different rates as i 
becomes infinite, and do not obey the requirements of the corollary of Theorem 
2 because neither vanishes throughout the infinite intervals for which | (j > c. 
But the following theorem and corollary will take care of this and of similar 
cases. It will be observed that Theorem 3 includes Theorem 2 as a special case. 

Theorem 3. Using the foregoing notation, let all of T-space be divisible into 
two parts Qo and Qi satisfying the following conditions. In Qo let fiiT) — ^(T) = 
S{T)\l>iT), and let | S(T) | < «. LetT = Qo+ Qi and 

f UT)dT -f f 4,(T)dT < . 

•'Oi Jqi 
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Then 


[ Di(X)dX - f D(X)dX 


— ^ f ^(X}dX "H £i ^ e 6 i. 


Proof. As before, 


[ DidX - I DdX = [ (^1 - ^)dT = f (^1 - y(;)dT + f - ^tddT 

•'j ■'p •'« Jjo hi 

1 / BidX - f Ddx\ < I [ ^|,)dT\ + I f (i^i - i^)dT\ = 1 + 11. 

I •'p ‘'P I I •'flo II hi I 

I < e f DdX<i. 

h 

11< [ ^idT+ f ipdT< f ^idT+f iPdTKei. 

Jii hi •'Qi hi 

These inequalities together prove the theorem. 

Corollary. In particular, let ij/, <f>i , and x he as in the corollary of Theorem 2, 
except that now, instead of requiring ^i{t) to vanish when | ^ | > ewe shall let Qi 
and €i he so chosen that 


[ ^i(h) • • ’ <hiitn)dT + f <l>iti) • • ’ (t>{t„)dT < ej. 

hi Jqi 


Then 


I f Di{x)dx — f D{x)dx < f f D{x)dz + ei < 
I Ja Ja 


+ Cl- 


As before stated, the inequalities of this paper apply to all statistics for which 
the integrals involved exist. It seems probable that closer inequalities could be 
devised by placing appropriate restrictions on the g functions which define 
these statistics. 
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AN EXPERIMENTAL DESIGN FOR SLOPE-RATIO ASSAYS 
By C. I. Bliss 

Connecticut Agricultural Experiment Station and Yale University 

1. Summary. WLen the response to a drug is a linear function of arithmetic 
dosage units, the relative potency of two preparations can be computed as a 
slope-ratio assay. Their dosage-response curves are computed by solving three 
simultaneous equations to obtain the common intercept a\ the slope of the stand¬ 
ard, 6i, and the slope of the unknown, 6a • The method is applicable to certain 
microbiolo^cal assays for the vitamins. Usually several unknowns are assayed 
at one time with a single standard. Their calculation is simplified when such 
assays meet the following requirements: (1) restriction of treatments to the zone 
within which the response is related linearly to the dose, (2) equal spacing of 
doses on an arithmetic scale beginning with the negative control, (3) an equal 
number (fc) of doses of standard and of each unknown and (4) r replicates for 
each dose of unknown, h' replicates for the negative control and h replicates for 
each dose of the standard. 

2. Method of Analysis. The design and analysis of assays for measuring drug 
potency has been developed largely about the linear relation between response 
and the logarithm of the dose of many drugs. An alternative procedure is 
available when some measure of the response is related linearly to arithmetic 
dosage units, Recently Finney [6] has applied the technique to microbiological 
assays of the vitamins. The relationship is also suitable for experiments with 
toxic agents on micro-organisms, where the length of exposure to treatment is 
the dose. Since potency is measured from the ratio of the slope of the dosage- 
response curve for an unknown to that for the standard preparation, Wood [6] 
has termed the method a “slope-ratio assay.” 

The validity of quantitative biological assays depends upon a qualitative 
similarity between the standard and the active agent of the unknown. When 
the response is related linearly to the log-dose, this is determined by testing the' 
parallelism of the lines fitted separately to the results for the standard and to 
those for the unknown preparation. If the departure from parallelism is ivithin 
the sampling error, the combined slope is determined from the data on both 
preparations and used in computing potency and its error. The analogous test 
in slope-ratio assays is the convergence of the lines relating response to arith¬ 
metic dose at zero content of drug, using drug as a generic term which includes 
vitamins, poisons and physical agents. When the curves for the standard and 
the unknown are computed separately, their zero intercept should agree within 
the experimental error. In. assays meeting this requirement, the curves are 
computed so that they are forced to intersect at zero dose. The curves 


yi — a’ + hxi 
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and 

Vi — of bjiCj 

are fitted by solving three simultaneous equations to obtain the three statistics, 
a', bi and which are the best estimates of their respective parameters. Finney 
[6] has illustrated the technique with data from the microbiological assay of 
nicotinic acid and given a suitable test for convergence as well as the error of the 
estimated potency. 

The calculation described by Finney is flexible but not adapted for routine 
use. With certain restrictions in design, the calculation can be reduced to a 
practicable form for the assay of (m — 1) unknowns against a standard prepara¬ 
tion. These restrictions are as follows: 

1. Doses both of standard and of unknoivns must fall withm the range for 
which some function of the response is related linearly to an arithmetic scale of 
dosage units ivith convergence at zero dose. 

2. Within this range the doses (x) of standard and of all the unknowns must 
be spaced similarly and preferably equally on an arithmetic scale, begmning 
with the negative control (a: = 0). 

3. The doses of each unknown must match those of the standard in respect 
to both number (k) and their expected potencies, so far as the latter can be 
judged in advance. Within an assay group there may be h' replicates of the 
negative control, h replicates of each dose of the standard and r replicates of each 
dose of each unknown. 

4. Some element of randomization must be introduced within an assay group 
in respect to the preparation of the tubes, their handling and the reading of the 
results. Replicates of any given dose or of the negative control must not be 
prepared together. 


3. Computational Procedure. The simplified calculation of potency and its 
error depends upon substituting the assumed for the actual doses. When 
spaced equally on an arithmetic scale, they may be coded by using the numbers 
1, 2, 3, ■■■ k, k being equal throughout the assay. The sums of the coded 
doses, Si., and of their squares, Si, are then the same for each preparation and 
may be entered in the equations for computmg the inverse matrix, of which the 
first three are 


( 1 ) 


Nctu -|- hSicii + rSiCu 
hSiCQt hSicu 
rSiCoi + rSiCi, 


1 = 0 i = I i = 2 

1 , 0 , 0 , • ■ ■ 

0 , 1 , 0 , • •. 

0 , 0 , 1 , . •. 


where the total number of observations is JV = h’ + kh -t kr(m — 1). 
plying the last two rows by — Si/Si and adding the products, wO have 


1.T hSi , .V rtSi 1 . 

W - - (m - 1) ^co. = 1, 




Multi 
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■where the subscript i refers to the standard and the assay includes 2 to » unkno-wn 
preparations. Substituting 

D = NS2- hS\ - rim - , 

this leads to the folio-wing reciprocal coefficients: 


Coo — Si/D 

Coi “ C|0 * " Sl/Df 2 ‘ 

Cii = l/hSi Si/DSi 

Ci, = l/r ^2 “H Si/DSi , 1 = 2, 3, • * ■ m, and 

df = Sl/DSi for i, j = 1,2, “■ m, where i j. 

The reciprocal coefficients are computed from the sums of the doses and their 
squares, -which are the same for all preparations. The doses are multiplied by 
the responses observed at each dosage level to obtain T, = Sixyi) for any given 
preparation. For the standard there -will be h responses at each dose and for 
each unknown r responses. Let T = SiT,) be the sum of these products over all 
m preparations. The total response for all N observations Siy), including the 
negative control, the standard, and all the unkno-wns, is designated as T „. 

Using normal regression theory, the common intercept is computed as 

a' = CoalT* + CmT. 

Substituting the above reciprocal coeficients, 

(2) a' = (SiTy - SiT)/D. 

The slope of the standard is computed with the reciprocal coefficients as 
hi = cciTy + Ciiri + CuT — CuTi, 

We may take advantage of the identities 


Si 

Cm = — Coo 
02 


and 


Si 

Cti ~ — ^ Cot 
02 


to obtain 


hi = (cii - Cu)Ti - |la' 
02 


reducing to 

(3) 


hSi Si 


Similarly the slope of each unknown is equal to 

h = Co,Ty + CuTi + CiiT, + c„T - c„{ri + T,} 
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where i, j 
(4) 


2, 3, • - m and 39 ^ 1 . Since ci, - c.y = 0, this may be reduced to 
, _ T, a^Si . ^ „ 


The computation is further simplified if the k doses of all preparations are 
spaced not only similarly on an arithmetic scale but also at equal intervals. 
In this case 


= fc(fc + l)/2 and Ss = k{k + l)(2fc + l)/6. 

Substituting in equations (2), (3) and (4), the common intercept, the slope of 
the standard and that of each unknown may be computed as 

. / ^ 2 (2fc + i)r,-^6r 

N(k - 1) + ih'ik + 1) 

'■‘-2-rVi{iTO-4 

In computing the slope for each unknown in an assay the only variable is T,. 
The intercepts and the slope can be checked by substitution in the equation 

(8) 2Ncl' "j~ hk{k -|- l)&i -(- Tk{k -f* 1)(&2 “b * ’ * “b ^m) ~ 2Ty , 

In terms of coded doses, the potency of an unknown (f) relative to that of the 
standard ( 1 ) is computed as 


(9) 




6 . 

W 


Each J' is converted to original units by multiplying it by the ratio of the dosage 
intervals, /,//«, the potency being 


( 10 ) 


= 

W„' 


The variance measuring the distribution of the observations about the m 
lines may be determined as 


( 11 ) 


2 _ S{]t) - a'Ty - biTi-- 

* N — m — I 


The variation about the individual linos is assumed not to vary from one prepa¬ 
ration to another. This is more likely to be true when the assumed potencies 
differ but little from those computed from the assay, so that J' differs relatively 
little from unity. 

The confidence liiruts for potency as estimated from the ratio of the slopes 
may be computed from Fieller’s basic formula [4], For confidence limits, Xl , 
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at an appropriate level of significance, such as P = 0.05, t is read from the Stu¬ 
dent-distribution for iV — m — 1 degrees of freedom and entered with s* from 
equation ( 11 ) in the equation 

(12) Xlihi - cnsh^) - 2Xi(bib. - Ci.sY) -f (b“. - cusY) < 0, 

where i indicates one of the 2 tom unknown preparations. When solved for 0 , 
the limits may be written 

(13) „ bib, - Cus^t ^ 

„ „ 2,2 

wl ” CllS i 


± at V(cii — ci.)b* 4- (c.t — cn)b? + cujbi — b<)^ — (ciicv< — 

b? — 


where Cu — cu = l/hSi, — cu = l/rSjand cuc„ — — ~ , 

rhDSt 

In all critical cases, the exact limits should be computed. 

In most slope-ratio assays the individual slopes differ very significantly from 
zero. Under these circumstances the approximate limits may be computed 
with reasonable accuracy from the variance of the estimated potency by the 
familiar formula for the variance of a ratio [ 1 ]. 


(14) 


V{J') 


(cjti , _ 2 cm \ 

bl lb? b? b,b,/ 


= ^ ((cii — Ci<)b? -H (cti 


cu)bl + cu(h - h? 


The discrepancies between the approximate and the exact limits are evident 
from a comparison of equations (13) and (14). When the doses are spaced at 
equal arithmetic intervals, equation (14) can be reduced to the more convenient 
form 


. 2 . _ _ 65 “_ j h + rJ'^ ^ 3(1 - J'f \ 

" b?(2h -f 1) \rhfc(h -h 1) N{k - 1 ) -j- Zh\k -fl)/ • 

A major limitation to slope-ratio assays is the frequent curvature in the rela¬ 
tion between response and arithmetic dosage units. For this reason it is advis¬ 
able to use routinely four or more doses of each preparation. Occasionally an 
assay in which there is curvature at the highest dosage level may be salvaged by 
computing the potencies from the data of the smaller doses. The agreement of a 
given assay with the postulate upon which it is based may be tested objectively 
by an analysis of variance, segregating the sums of squares (a) for the agreement 
of the negative control with the intercept, (b) for the agreement of the individual 
curves at the intercept, (c) for agreenient of the observations with straight lines 
fitted individually and (d) for the variation among the h replicates of the stand¬ 
ard, the h' replicates of the negative control and the r replicates of the unknowns. 
The calculation of such an analysis is greatly facilitated by the recommended 
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design. Since it follows the usual pattern, it will not be described here. The 
procedure has been tested \^'ith the data from an experiment on the depth dose 
of x-rays [2] and has been applied to microbiological assays 13] in papers where 
the reader will find the technique exemplified. 
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NOTES 

This section is devoted to brief research and expository articles, notes on 
methodology and other short items. 

COMPUTATION OF FACTORS FOR TOLERANCE LIMITS ON ^ NOR¬ 
MAL DISTRIBUTION WHEN THE SAMPLE IS LARGE‘ 

By Albert H. Bower 
Columbia University 

In their paper [1], Wald and Wolfowitz discuss the problem of finding tolerance 
limits of the form j rfc Xs for a normal distribution. They propose the following 
large sample formula for X which appears to be satisfactory for all practical 
purposes for iV ^ 21 



where N is the number of observations (n = W — 1 ), t is the tolerance coeffi¬ 
cient, /3 is the confidence coefficient, r is defined by 

/rr I _ e dt <== y 

and xl has the property that P(x > xa) *= 0 forn degrees of freedom. To compute 
X, tables [2] or known approximations [3] for xl are customarily used, but the 
computation of r, even for large N, is tedious, involving an iterative procedure. 
The purpose of this note is to obtain an expansion of r in terms of 1 /•\/N and to 
combine this expansion with a known one for xa to obtain an assrraptotic formula 
for X. 

To derive a large sample formula for r, consider the function 

where for convenience and r are replaced by x and y. It is desired to express 
y as a power series in a:. Let j/o be defined by /(0,yc) =’ 0. Since J{x,v) is a con- 

* This paper reports work done in the Statistical Research Group, Division of War Re- 
search, Columbia University, under Contract OEM8r-018 with the Applied Mathematics 
Panel, National Defense Research Committee, Office of Scientific Rosearoh and Develop¬ 
ment. The work was first reported in an unpublished memorandum, “Computation of 
Factors for Tolerance Limits when the Sample is Large" (SRC No. 669, September 24, 
1945) A brief account of the application of tolerance lintiits, including tables, will be 
published in Techniques of SlatisUcal Analysis described in the footnote on page 217. 
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TABLE 1 


Comparative Values of Exact and Approximate A 




50 

100 

llSO 

\ 

7 

Exact 

Approx- 

unate 

DiL 

fa- 

eace 

Exact 

Appiox' 

unate 

Dif¬ 

fer¬ 

ence 

Exact 

Approx¬ 

imate 

Dif¬ 

fer¬ 

ence 

.76 

.75 


1.26147 

.00333 


1.21698 




.00063 


.96 

2 13774 

2.13226 

.00648 

2.07533 





.00089 


.999 

3.68821 

3.57979 

.00842 


3.48112 



3.43563 

.00141 

.96 

.76 

1.39621 

1.38467 



1 30670 




.00182 


96 

2 37866 

2.36921 

BiBSM 

2.23279 

2 22635 


2.16728 

2.16420 

.00308 


.999 

3.99259 


.03179 

3.74835 

3.73776 

01059 

3.63860 

3.63341 

.00609 

.99 

.76 

1 51184 

1.48901 

.02283 

1.38261 

1.37611 

00740 


1.32216 

00361 


.96 

2,67665 

2,63698 

.03867 

2 35546 

2.34290 

.01256 

2.25865 

2 25268 

.00697 


999 

4 32326 

4.25926 

06399 


3 93343 

02086 

3.79189 

3,78196 

.00993 


Comparative Values of Exact and Approximate Continued 


X 

B 

500 

800 

1000 

Exact 

Approx¬ 

imate 

Dif¬ 

fer¬ 

ence 

Exact 

Approx¬ 

imate 

DU- 

ter- 

ence 

Exact 

Approx¬ 

imate 

Dif¬ 

fer¬ 

ence 

.76 

.76 

1.17733 

1.17724 


1.17126 

1,17122 


1,16891 

1.16888 



.96 



00016 


1.99552 


1 99158 

1.99163 



,999 

3.36769 

3.36744 

.00026 



lyy 

3.34361 

3 34362 


.95 

.76 


1.21470 



1.20047 


1.19602 

1.19491 

,00011 


.96 

2.07013 




2.04536 



2.03689 

.00019 


.999 

3 47647 

3.47469 


3.43433 

3.43390 


3 41831 


.00031 

.99 

.76 

1.24268 

1.24208 


1.22198 

1.22169 


1.21395 

1.21374 

00021 


.96 

2.11727 

2 11626 



2 08152 




.00035 


.999 

3,66462 

3.66292 


3 49643 

3.49460 


3.47244 

3.47186 

00068 


rif 

tinuous function of x and y, and since 

dy 


»-*0 

v-yo 


^ 0, the function y(x) defined 


(iiU St i « 

implicitly by (2) is continuous. Since ^ ~ ^ ^ higher deriva- 

dy 

tives of y{x) exist and are continuous and y{x) permits of a finite Taylor’s ex¬ 
pansion. The coefficients of odd powers of x drop out and we obtain. 

I 2/0 2 1 3i/o 2j/o _4 I 

y = yo+ + —4i— ® 


2 ! 
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Oil returning to the original notation and retaining terms in 1 /iV^, 


(3) 



1 r*!- 

If Xj, is defined by £ 


e = p we know from [3] that 


(4) 


xa ^.\/2xi^ 2xi_a - 1 

n 3 n 


Proceeding formally and retaining terms in 1/N we obtain 



Xi-fi . 4 + 6xi_a\ 

12A^ / 


and multiplying by the expression for r given by equation (3) we find the desired 
expansion for X. 


(5) 


X Tot 


(‘ 


. 5xi-ff + lO'X 

V2N i2iv' y • 


Recall that both and xi-g are readily obtainable from tables of the normal 
curve, in fact, r«, is defined by 


j e~‘‘'^dt = 7 and xi-fi is defined by = 1 — fi. 

A comparative table of approximate and exact values of X is given in Table 1 • 
From the table we see that for N ^ 800 the error is less than 1 in the 4th sig¬ 
nificant figure, and for N ^ 160 the error is less than 1 in the 3rd significant 
figure within the limits of /3 and 7 considered. The approximation will be less 
exact for higher values of /3 and 7 . 
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THE PROBABILITY DISTRIBUTION OF THE MEASURE 
OF A RANDOM LINEAR SET 

By David F. Votaw, Jr. 

Naval Ordnance Laboratory 

1 . Mtroductioa. (Consider a random sample 0n(a;i, • • ■, !„) of n values of a 
one-dimensional random variable x having cumulative distribution function 
F{x), Let there be associated with each x an interval of length D centered at x 
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{D a positive constant). Let S(0*) denote the random set which is the point-set 
sum of the n intervals associated with On ; S(0n) is a set of one or more intervals. 
Let S denote the measure of S(0„) (S is the sum of lengths of the intervals 
composing SCOJ). Given F, n and D, what is the probability function of S? 
This note contains a solution of the problem for Fix) = x, {0 < x < 1)-, the case 

of F(x) = / Ne d(, (0 < X < CO; /f > 0), is also treated. 

Jo 


2. Sampling from a uniform distribution. Let y = S - D. The range of 
y is 0 < y < OT, where m denotes the minimum of 1 and (n - 1 )D. Let xt, 
■ • In be the sample values arranged in increasing order of magnitude. Make 
the transformation 


( 2 . 1 ) 


yo = xi 

y, = x,+i ~ Xi, (j = _ 1). 

ft—1 

can be expressed as X) MVi, D), where miy,, D) denotes the minimum of 
1-1 




ft—1 / 

, and D. The probability function of (yo, yi, • • •, y„_i) is 71 ! n dVu , (yu > 0; 

u-0 \ 

n-1 \ 

2 yu < 1 )• If m = (ti — 1)D, then y ^ {n — 1)D if and only if y< > D, (t = 1, 

d-O / 

• • •, n - 1) i for a fixed yo it can be shown by use of the Dirichlet integral that 
the volume of the (n — 1) dimensional region in which any point (yo, yi, • • *, 

y„_i) satisfies this condition is follows that: 


( 2 . 2 ) 


Pr 


fy s= (n — 1 )Z)} ~n U — yo - (n — 1 ) 2 )]" ‘ dyo 


JVimO 

“ [1 — (n — 1)2)]", 


((n - 1 )D < 1 ). 


The probability that Y < y < Y + AY (where Y < m and AY denotes an 
arbitrarily small positive increment in Y) can be evaluated by determining 
volumes of certain regions contained in the tetrahedron defined by j/u > 0 , 

n-1 

UVu < 1 . Consider the following conditions: 

u-O 


(a) qD<Y<(q+ 1)2) 


(b) y.^D 


iq « 0 , 1 , • • •, iif; Af denotes the minimum 
of (n ~ 2 ) and the greatest integer less 

»• 

(u « 1, < 3 ), 


than 


(c) 


ti.< 


1 




y + i 2 ), 


(d) y, < D 


(f = ;■ + 1 , — 1 ). 
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The probability that 7 < y < F + (b), (c) and (d) are satisfied is; 

fT+ir dy 

(2^) A,In, V>) iV _ J_-J. 

where Aj{y, yo) denotes the j dimensional volume of the region in which any 
point (yi, • ■ Vi) satisfies (b) and (c), and Bj{y) denotes the {n ~ j - 2 ) 

dimensional volume of intersection of the hyporplane 'Ei Vv - y ~ jD with an 

in ~ j - 1 ) dimensional cube (0 < y„ < D). It is clear that if any other of 

the T combmations of j y’s out of the set of (n — 1 ) y’s had been specified 

in (b) and the (n ~ j — 1 ) complementary y's had been specified in (d), the 
corresponding A^iy, j/o) and Bjiy) would be equal to those given in (2.3); hence 

Pr {7 < y < r + AF} = nl i: T B,(y) 

(2.4) . f 

Jv 

qD < Y < (q + 1)A Y < m, (q = 0,1, 

^i(y, yo) = , and (see ( 1 ] and [ 2 ]) 

(2-5) B.W - IS ~ r > - + r)!-« 

From (2.4) and ( 2 . 6 ) it follows that the probability function of y, say/n(y), is: 

^)(i + ’•)]' 


a-r dv 


1 ’ 


( 2 . 6 ) 


1 n-y-2 


5 ^* < y < (g + 1)D, (q = 0, > • ■ , M), y <m. 

U{y) is not defined at {n — 1)D if (n ~ 1 )D < 1 (see (2.2)); if m = 1, the range 
of definition of /„(y) as given in ( 2 . 6 ) is y < 1 . 

The cumulative distribution function of y is continuous with the exception, 
in the case of (n - 1 )Z) < 1 , of a saltus of amount [1 — (n — 1 ) 1 )]" at y = 
(n - 1 )D (see (2.2)). The probabDity function /„(y) is continuous over the 
range 0 < y < m with the exception, in the case of n > 3 and (n — 2)D < 1, 
of a simple discontmuity at y = (n — 2 )D. 

For n = 2 and D < 1, 


My) = 2(1 ~ y), 


(0 < y < D), 
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andPrli/ = K} = (1 - D)\ 

Por n = 3 and 2D < 1, 

Uy) = 6(1 “ 

m = 6(1 - y)v - 12(1 - y){v - D) + 6(1 - yf 

and Pr (y = 2D) = (1 - 2D)’. 

The expected value, say E{y), of y is: 


(2.7) 


» - (1 - -O)"! 

_ («- 1 ) 

(n + 1) 


(0 < 2/ < D), 
iD<y< 2D), 


(D < 1 ); 


(D > 1). 


The expected value of >S is D + E(y). E(y) can be derived by use of (2.6) 
or by use of a theorem of Robbins [3]. 

3. Probability that random linear set covers range of variate. Given that 
F{x) = s, (0 < a: < 1), and tiD > 1, what is the probability, say JPd , that 
S(0n) contains the interval (0 < a; < 1)? If D < 1, the interval is covered 
if and only if (i), (ii) and (iii) below are all satisfied: 


(i) 

(ii) 

(iii) 


2/u < D, 

n-I / 

S ^ (l 

u-l \ 

. D 

yo < 2 - 


yt 


-?)■ 


(u = 1, •••,« - 1), 


„Pd can be expressed as follows: 

pDIt O-VO 

(3.1) - n! j i 




dz 


dyo, 


where Cn_i(3) (see [2]) denotes the (n — 2) dimensional volume of the intersection 

n—I 

of the hyperplane X) l/u — ^ with an (n — 1) cube 0 < 2 /u < D. It follows from 

U-.V 

(2.5) and (3.1) that 

- E (- 1 )“ ( )(1 - 

(3.2) -2 £ (-«-(%ll-'^-2) 

+ '“g" (-!)■ (“; ^)(i - 

where D < 1 and [a:] denotes the greatest integer less than x. If 1 < D < 2, 
/ D\" 
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4. Sampling from F{x) = f He~^‘ dt, {0 < x <, to ■, H > 0). If F(x) = 

Jo 

I the probability function of S can be determined but is very cumber- 

Jo 

some in the form in which it is known to the writer. The characteristic function, 
say ffW, of the probability function of S will be given instead. By use of (2.1) 
it can be shown that; 


(4.1) 






t9 ~ \H 


where i = 

The expected value, E(S), and variance, vs, of S are: 


(4.2) 


EiS) = D + E 

// X-1 


1 Ft 

1 (1 — e ) 


<ra 


> _ 1 V ® 


X 

--20BX 


) _ 2D 

H \ 
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INFORMATION GIVEN BY ODD MOMENTS 

Bt Edmund Chubchilu 
Eutgers University 

The widespread use of the third moment about the mean as a measure of skew¬ 
ness and the belief engendered by this use that a distribution is symmetric if its 
third moment is zero prompt the question of how much information about a 
distribution can be deduced from a knowledge of its odd momenta. An answer 
Let F{x], a cumvlaiive distribution function; (n = 1, 

2, ‘' •), a sequence of real numbers,' and « > 0 be arbitrary. There exist^ a c.d.f,, 
F (a), having as odd moments the terms of the given sequence and such that 

(1) I Ft^x) - F*(*) 1 < *, o« x. 

If the mean of F(,x) is equal to ps and the variance of F{x) is not zero, it can be 
shown that F*(x) may be chosen so that in addition the variance of F*(x) is 
equal to that of F{x). 

An immediate consequence of our statement is that a distribution need not be 
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symmetric even though all its odd moments vanish. Such an asymmetric distri¬ 
bution, due to Stieltjes, is given by: 

(2) dF{x) = 1/48 (1 - fc sin (ij*) di, - co <a;< = -lUx <0, 

k = 1 a X > 0. 

The proof of our statement will follow easily from the following: 

Lemma. Let i, a scgucnce of real numbers be given. There exists a c.d.f. 
having as odd moments the given numhe/s. 

We construct a sequence {^f„l of increasing step-functions in such a manner 
that for every n, the first n moments of H„ are the first of the given numbers, 
and such that this sequence converges to a monotone function having all the 
desired moments. A slight modification of this function will give the desired 
c.d.f. 

Let Ih be identically zero. We form Bi by adding to Hi a jump or mass of ^ 
at X = 2mi. In general, Ih is formed from ff*_i by adding to it fc masses chosen 
so that their first (fc - 1) odd moments are zero and so that the fcth odd moment 
of Hy is mifc-i. This we do by adding tlie masses | x, 1, (j’ = 1,2, • • • , fc), at the 
points e,jp where the x/a are the solutions of: 

pxi + 2pxi -!-.••+ kpxy = 0 

p’xi 4- (2p)’xi 4--b (fcp)’xit = 0 


p^’xv -f (2p)’*“*x, + ... 4- (fcp)’‘"‘xv = 0 

P^'xi 4 (2p)“"'xj 4"-h (fc?)**"*** = iThk-i - m{Hy-.f), 

m(Hy~j) is the fcth odd moment of Hy^i , Ci is the sign of x/ and p is a parameter. 
Since the determinant of this system is a Vandermonde determinant, there exists 
a unique set of solutions for every non-zero value of the parameter. The masses 
thus chosen clearly have the sjoecified moments. Eliminating p from the left 
sides of the equations by division, it is apparent that the X/’s are all linear func¬ 
tions of p"'’*"”. Thus we may choose p so large that the sum of the masses 
added at this step does not exceed 1/2*. The absolute odd moments of orders less 
than (2k - 1) of these fc masaes are also linear functions of negative powers of p. 
We may thus insure by further increasing our choice of p that the (2fc - 1 - 2r)th 
absolute moment of Ily does not exceed the corresponding moment of Hy-i by 
more than l/2^ For definiteness, we choose p as the smallest number satisfying 
these requirements. 

The first of these restrictions on p insures that for each value of x, the sequence 
ff„(x) is increasing and bounded from above by one. The sequence of functions 
thus converges to a monotone function H*(x) with the property that H*(~ «j) 
= 0, H*( to) < 1, The other restrictions on p insure that the sequences of abso- 
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lute odd moments of all orders are uniformly bounded, a bound for the abso¬ 
lute moments of order 2fc - 1 bemg one greater than the absolute moment of 
this order of Hk. This in turn insures that the odd moments of H*(x) exist and 
that they have the desired values. By adding a jump of 1 — H*i ca ) at the 
origin we obtain H(x), a c.d.f with the given odd moments, 

The main statement of this note is an immediate consequence of the lemma. 
Let the ifcth odd moment of Fix) be Afat-i, which we assume to be finite, and let 
the sequence [nhk-i] be defined by the relationships; 

/ujfc-j = (1 ~ + ftthk-i , (k = 1,2, ■ • • 

Let H(x) have the m’a as odd moments. The c.d.f. F*[x) defined by 
F*(x) = (1 - t)F(x} + eHix) 

clearly has the properties stated above, and our statement is proved. If the 
momenta of Fix) are not all finite, the proof will need only minor modifications. 

If one asks m addition that F* have a finite range, F* wiU, in general, not 
exist. If, for example, the range of F is finite and its odd moments are zero, 
then F must be symmetric about the origin, for F* defined by dF*ix) ~ dFi—x) 
would have the same moments as F. But a c.d.f. with finite range is determined 
by its moments; hence Fix) - F*ix). 


SOME ORDER STATISTIC DISTRIBUTIONS FOR SAMPLES 
OF SIZE FOUR 

Br John E. Walsh 
Princeton University 

1. Summary. Let a:i , J;j , *5 , represent the values of a sample of size four 
drawn from a normal population. There is no loss of generality in assuming 
that the distribution function of this population has zero mean and unit vari¬ 
ance. Denote it by N (0,1). Let »(,•) be the ith largest of xi,Xi,X 3 ,Xi. The 
purpose of this note is to determine the joint distribution of 

+ ^(3) - X(5) - X(i) , a!(4) - X(3j + x(j) - X(i) , and — a;(8) + X(i ), 

and derive from this joint distribution the joint distributions of these statistics 
taken in pairs, also the distribution of each statistic itself. 

2. Analysis. Consider the joint distribution of 

ri = K®* + xj - xj — xi) 
rz = - xj + 12 - xi) 

’■3 = — X3 — Xi + Zi). 
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Evidently, 

Ein) = 0, a = 1, 2, 3). Einr,) = 0, (i 9^ j). E(r^ = 1. 

Hence the u are independently distributed according to JV(0, 1). 

Let Vj be the jth largest of 1 |, j rj |, j ra |. Then by first finding the joint 

distribution of | ri |, | rj |, | ra | and then applying the distribution for order sta¬ 
tistics [1], it is easUy seen that the joint distribution element of «i, va, va is 

^^fivi)fivi)Jivi)dvidv!dva , 

where 

1 I a 

f(y) = ® ^ J 0 < < Da < % . 

Examination shows, however, that 

Ua = + 35(8) — a:(a) — a:(i)) 

Da = K35(4) — 3:(8) x^i) — X(x)) 

Vx = \ \ a!(4) — *(3) — 35(2) + X(l) 1 
Let 


Wla = X{i) + X(t) — X(i) — X(i) 

mi = X(,t) - *( 3 ) + XiD - Xn) 

mi = 31(4) — *(8) — X(i) + a;(i). 

Then the joint distribution element of \mi 1, wia and nta is 

6/(11 mi \)f{\m^S{hm^d \ mi | dm^ma . 

Since the function / is symmetrical about the origin, it follows immediately that 
the joint distribution element of mi, ?n 2 and mz is 

3f{^mx)f(^Tn2)f{^3)dmidnhdrnz , 

where | mi | < nh < mz. 


3. Derived results. By taking marginal distributions it is found that the 
joint distribution elements of mi, ma and mz taken in pairs are 


gi(.mi, mi)dm: 


,1 dmi — 3 (^f 


My)dy) Mmi)f(imi)dmi dmi. 


mi 


^a(mi, m 3 )dmi dmz = 3 ^f(iy)dyj f{^mi)f(^mz)dmi dmz 


gz(mi , mz)dmi dmz 


-(f- 


f{iy)dy)f(hmi)f{^mz)dini dmz. 


J 
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The distribution elements of mi , Vk and mi are seen to be 
ffiimOdmi = -^ I fiWvj Mmi)dmi. 

gi{7th)dmi = 6^jf fWyjfilmijdmi 


/•mj 


j/3(ma)dms = 3 f(k)dyj Mmsjdms 
It is to be noted that if a > 0, 


Pr(0 < mi < a) ~ Pr{-a < Wi < 0) = ^ - 4 > 


/ fan \J / fWt \S 

Pr(0 < mi < a) = 12 U f{y)dyj " 16 (| fWyj , 

j fan y 

Pr(0 < mj < a) = 8( j fiy)dy] , 


so that the probability that any of mj, ^ 2 , mi lie between two given numbers 
is expressed explicitly and can be calculated with the aid of standard tables for 
the normal distribution. 


4. Generalization of method. The method used to obtain the joint distribu¬ 
tion of the order statistics wii, m* and Wj was to take all possible combinations of 
4 variables with two plus and two minus signs (except for factor of -1) and 
show that these combinations behave as normally distributed independent 
variables. The question arises as to whether this method of finding order sta¬ 
tistic distributions would apply in general to 2n variables with n plus and n 
minus signs. It is easily proved that this will occur only when n = 2. 

REFERENCES 

[1] S. S, Wilks, Malhmattcal Statistics, Princeton Univ. Press, 1943, p. 90. 



NEWS AND NOTICES 

Readers are invited to submit to the Secretary 0/ the Institute news items of interest 

Institute of Statistics of the University of North Carolina 

Announcement of detailed plans for the North Carolina All-University Insti¬ 
tute of Statistics has been made by Professor Gertrude M. Cox, Director of the 
Institute. 

To provide graduate-level trainmg for students in statistics and to combine 
the theoretical or mathematical statistics with applied or experimental statistics, 
a Graduate Department of Mathematical Statistics is being set up at Chapel 
Hill with Professor Harold Hotelling as Head. The existing Department of 
Experimental-Statistics at Raleigh is a part of the Institute, and will be headed 
by Professor Gertrude M. Cox with Professor W. G. Cochran as Director of 
Research. Professors Hotelling and Cochran v'ill be Associate Directors of the 
Institute. 

Professor Hotelling, ivho will head the Department at Chapel Hill comes to 
North Carolina from Columbia University, where he has been directing its 
graduate mathematical statistics program. Previously, he had held positions 
with the University of Washington, Princeton University and Stanford Uni¬ 
versity. His undergraduate training was taken at the University of Washington 
where he majored in journalism; his Master of Science degree was awarded by 
the same institution in mathematics; and his doctorate by Princeton University, 
also in mathematics. In addition, he has done some graduate work at the Uni¬ 
versity of Chicago. Professor Hotelling’s publications in mathematical statistics 
are numerous and well known. Among the members of his staff mil be a visiting 
professor, M. S. Bartlett, on leave of absence from Cambridge University. A 
graduate of Cambridge and native of England, Bartlett has also held positions 
with the University of London and the Imperial Chemical Industries, and during 
the war was engaged in war research in London. 

In addition, P. L Hsu, William Madow, and Herbert Robbins, will be mem¬ 
bers of the Department at Chapel HiU as associate professors. Hsu, a native 
of China, has held teaching positions with the University of Peking and the Uni¬ 
versity of London. He received his degrees from Tsinghua University and 
the University of London. 

Madow is now in Brazil, where he is serving as a visiting professor of statistics 
at the University of Sao Paulo. He received his training, both undergraduate 
and graduate, from Columbia University, and has worked with the Departro,ent 
of Agriculture Graduate School and the Bureau of the Census in Washington. 

Robbins will come to the University of North Carolina from New York Uni¬ 
versity where he has been serving as an assistant professor. Prior to that, 
he was a staff member of the postgraduate school of the U S. Naval Academy, 
and an instructor in mathematics at New York University and at Harvard 
University, He holds A.B., A.M. and Ph.D. degrees from Harvard University. 
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The appointment of Edward Paulson iis an instructor completes the initial De¬ 
partment staff at Chapel Hill. A graduate of Brooklyn College and holder of an 
M.A. degree from Colmnbia Hnivernty, Paulson has been more recently study- 
ing mathematical statistics at Columbia under a pre-doctoral fellowship of the 
National Research Council. 

Professor Cochran came to North Carolina in March from Ames, Iowa, 
where he had been serving as professor in the statistical laboratory of Iowa 
State College. During the war years he was sent to England, Germany, and 
Austria on special work for the War Department, after spending a year at 
Princeton University where he served as research statistician on war work. A 
native of Glasgow, Scotland, Cochran has been in the United States since 1939, 
and is a naturalized citizen. Before coming to America, he was employed as 
statistician with the Robhamsted Experimental Station in England. Cochran’s 
publications in both the theory of statistics and applied statistics are well 
known, as is his experience with practical research problems. lie is serving this 
year as president of the Institute of Mathematical Statistics. He is a fellow 
of the American Statistical Association and a fellow of the Royal Statistical 
Society of England. 

Under the plans of the Institute, students who arc preparing to teach statis¬ 
tics or to develop statistical theory will take most of their training at Chapel 
Hill. However, work between the two branches will be so coordinated as to 
include instruction in the application of statistics os taught in Raleigh. 

For students who intend to become statistical consultants in various other 
fields, basic training will be taken in mathematical statistics, with the main part 
of the advanced applied training at Raleigh. 

For research students, on both campuses, who arc working in other sciences, 
iucluding agriculture, biology, medicine, psychology, sociology, economics, in¬ 
dustry, and textiles, training in both basic and applied statistics will be given. 

Working with Cochran in Raleigh are Professor J. A. Rigney; Associate 
Professors R, L. Anderson, J M. Clarkson, H. L. Lucas, and Paul Peach; 
Assistant Professor H. F. Robinson; Instructors Margaret Fleming, R. T. Monroe 
and Sarah Porter 

Collaborators working with the Raleigh unit are A. L. Finkncr, W. A. Hen¬ 
dricks and F. E. MeVay of the Bureau of Agricultural Economics; C. E. La- 
moiireaux and G. P. Weber of the Weather Bureau; and D. D. Mason of the 
Bureau of Plant Industry. 


Joint Session of the Institute and Section A of the AAAS 

A joint session of the Institute of Mathematical Statistics and Section A of 
the American Association for the Advancement of Science was held in the 
Municipal Auditorium at St. Louis on Saturday, March 30, 1946 at 2:00 P M. 
At this session invited addresses were given by Lieutenant Commander John H. 
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Curtiss on Statistical Inference and its Engineering Applications, and by Mr. 
Morris H. Hansen on Some Sampling Problems in Surveys of Business and 
Population. 


Personal Items 

Dr. Paul H. Anderson is at present Economic Analyst with the War Assets 
Corporation at Washington. He is also teaching mathematics in the evening 
school of American University. 

Assistant Professor T. A. Bancroft has returned from a teaching position 
at the University Study Center at Florence, Italy, to his position at Iowa State 
College. 

Associate Dean Walter Bartky of the University of Chicago has been appointed 
Dean of the Division of Physical Sciences. 

Mr. Gordon L. Beckstead in working toward his doctorate in statistics at the 
University of California. 

Mr. Donald Cody has returned to his position as Assistant Actuary at the 
Equitable Life Assurance Society after spending three years in war research 
with the NDRC, the Naval Ordnance Station at Indianapolis, and the Naval 
Ordnance Station at Inyokern, California. 

Professor Allen T. Craig, after war service at the Postgraduate School of the 
U. S. Naval Academy at Annapolis, has returned to his position at the University 
of Iowa. 

Mr. James H. Davidson is studying for his doctorate in chemistry at Princeton 
University. 

Associate Professor J. L. Doob of the University of Illinois has been promoted 
to a professorship. 

Assistant Professor Churchill Eisenhart of the University of Wisconsin has 
been promoted to an associate professorship. 

Dr. Wayne Gutzman recently discharged from the Navy as Lieutenant, has 
assumed his new duties as Assistant Professor of Mathematics at the Postgradu- 
ate School, Naval Academy, Annapolis, Maryland. 

Mr. Bernard Hecht has been discharged from the Army and is now Chief 
Quality Control Engineer with the International Resistance Company at Phila¬ 
delphia. 

Dr. D. G. Humm has been elected president of the Southern California Acad¬ 
emy of Criminology. 

Mr. Amrom H. Katz is in charge of a group of physicists, engineers, and aerial 
photographers representing the Aerial Photographic Laboratory at Wright 
field, which will record photographically various aspects of the forthcoming 
atomic bomb test at Bikini Island. 

Mr. Edward A. Lew has ben released from active duty and has returned to 
his former position as Assistant Actuary of the Metropolitan Life Insurance 
Company. 
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Dr, E. V. Lewie is Junior Research Associate with E. I. duPont de Nemours 
at the Nylon Research Laboratory at Wilmington, 

Associate Professor M. C. MacPhail of Acadia University, Wolfville, Nova 
Scotia, has been promoted to a professorship. 

Mr. C. J, Maloney has been appointed to an instroctorship in the department 
of mathematics at low'a State College, 

Dr. Edward B. Olds is director of the Research Ritreim of the Social Planning 
Council of St. Louis and St. Louis County. 

Dr. A. M. Peiser has been appointed head of the Statistics Research Group 
at the Langley Field Laboratory of the National .\dvisory Committee for Aero¬ 
nautics. 

Mr. Robert J. Saunders has been released from the Army and is now connected 
with Mohawk Carper Mills at Amsterdam N. Y. 

Mr. Benjamin Stauber is now Chief of the Relocation Planning Division, War 
Relocation Authority. He has transferred from the Department of Agriculture 
for this work. 

Mr, Arthur I. Sternhell returned from the Army to his position as general 
staff assistant in the Field Management Division of the Metropolitan Life 
Insurance Company. 

Mr. Harry Weingarten has been appointed Tutor of Mathematics at the Col¬ 
lege of the City of New York, 

Assistant Professor J. R, Vatnsdal has finished his army service and has 
returned to the State College of Washington where he was promoted to an asso¬ 
ciate professorship. 

Mr. Bertram Yood has completed his duty in the navy and is now at Yale 
Station, Connecticut, 

A symposium on mathematical statistics and probability was hold at the 
University of California at Berkeley, January 28-30, 1946. 


New Members 

The following persons have been elected to membership tn the Institute; 

AlcMan, Prof, Armen A., PhD. (Stanford) Univ of Oregon, Capt. (A.C.) JJq. AAF 
Training Command, Ft Worth, Texas 

Bingham, M.D. 1920 S St., N. W„ Washington, D. O. 

Cannon, Edward W., Ph.D. (Johns Hopkins) Comdr., US Navy, Researoh and Standards 
Branch of Bureau of Ships, Cannon, Delaware 

Carvalho, Prof. Pedro Egydlo, Ph.D (8So Paulo) Univ, do Silo Paulo, Faouladade do Hi- 
giene, Avenida Dr, Arnaldo SB, Gaisa postal PB-B, Sao Paulo, Brazil 

Delsa, Alexis, A. I. Lg. (Liege) Mgr. Basle Bessemer Steelworks, Socidtfi Anonyme John 
Cookerill, Seraing, Belgium 

Duncan, David Beattie, B.SC, (Sydney) Graduate Student, Iowa State, Statistical Labora¬ 
tory, Ames, Iowa 

FroeUch, Kathryn, B.A, (Evansville) Statistician, US Dept, of Agriculture, Bureau of 
Human Nutrition and Home Economics, 1806 Monroe Si , iV. W., Washington 10, D. U. 
oldstlne, Herman H. Ph D, (Chicago) Institute for Advanced Study, Princeton, N J. 
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Hammond, Edward Cuyler, So D (Johns Hopkins) Major A.C , US AAF, Chief, Statistics 
of Plying Personnel Branch, Office of the Air Surgeon, JWO Conmchcui Ave , Washingion, 
D. C 

Hsu, Prof. Pao-Lu, Ph D, (London) Columbia University, 10^7 John Jay Hall, Columbia 
Umv , New York City 

Kyle, Garland Dean, M.S. (Michigan) Spectroanalyst, Physicist (US Navy)fi848 Filbert, 
Philadelphia 39, Penn. 

Lelbler, Richard A., Ph.D. (Illinois) Instructor, Purdue Univ , Math Dept , Lafayette, 
Indiana 

Lessard, Prof. Roger, C E. (Montreal) Hull Technical School, Hull, Quebec, Canada 

Moslmann, Thomas F., A.B. (Charleston) US Bur Labor Statistics, Regional Employment 
Analyst, Western Ave., Dallas 11, Texas 

Patte, W. Edmund, B.A.Sc. (Toronto) Stat, Eng , Canadian Industries Ltd , Shawinigan 
Falls, P.Q. Canada, 6 B0 — ISth St, Almaville 

Plza, Prof. Afionso P. de Toledo, Ph.D. (Sao Paulo) Eecola Politeohnica, Sfm Paulo, Brazil, 
Rua Ministro Godoy, 1133 

Rozen, Daniel I., A.B. (Columbia) Stat , Medical Statistics Div., Office of the Surgeon 
General, War Department, Rm 317-1, 341S S8lh St, N. W., Washington, D C 

Saldel, Frank, M.A. (Michigan State) Instructor in Math , Michigan State, East Lansing, 
Michigan 

Schmalz, William Herbert, B Sc.A. (Toronto) Technical Superintendent, Dominion Rub¬ 
ber Company Limited, Merchants Rubber Factory, 61 Breithaupt St., Kitchener, Ont. 

Stehn, John R., Ph D. (Wisconsin) Physicist, Research Division, Winchester Repeating 
Arms Co I New Haven, Conn. 

Tsao, Prof. Fel, Ph D. (Minnesota) National Central University, Chungking, China 

Weaver, Chalmers L., B S. (Kent State) Asst. Actuary, New England Mutual life Ins. 
Co , 601 Boylston St , Boston, Mass. 

Weber, C. Jerome (New York) Personal Trust Officer, The Chase National Bank of the 
City of New York, 11 Broad Street, New York City, Chappaqua, New York, Box S3 

Whitney, Donald Ransom, M.A. (Princeton) Grad. Asst , Math Dept , Ohio State Univ , 
Columbus, Ohio 

Wright, C. Ashley, M.A, (Princeton) Econ. Stat , Standard Oil Company, N. J., Box Si, 
RFD 6, Alexandria, Vo. 

Yost, EarlK., Jr., B.S (Washingtonand Jefferson) Grad. Asst., Math., Univ ofOklahoma, 
84 s College Ave , Norman, Okla. ' 



REPORT ON THE APRIL MEETING OF THE WASHINGTON 
CHAPTER OF THE INSTITUTE 

A meeting of the Washington Chapter of the Institute of Mathematical 
Statistics was held at George Washmgton University, Washington, D. C, 
on Friday and Saturday, April 12 and 13,1940, in conjunction with a meeting 
of the Washington Chapter of the American Statistical Association. 

More than 100 people attended the meetings including the following 61 mem¬ 
bers of the Institute: 

Theodore W, Anderson, Jr., Richard 0. Been, Archie Blake, David Blackwell, J, B, Bod- 
die, Glenn W. Brier, William Cohen, Jerome Cornfield, John H. Curtiss, Bessie B, Day, 
Robert Dorfman, Thomas I. Edwards, Andrew Fraser, Meyer A. Girshick, Clyde H. Graves, 
Margaret J. Hagood, Major Edward C. Hammond, Morris H. Hansen, Alston 8. Householder, 
Leonid Hurwicz, Irwin E. Jackson, Jr., Walter Jacobs, Hyman B. ICaitz, H. S. Konj'i, Lila F. 
Knudsen, Colonel S. Kullback, R. B. Ladd, H. G. Landen, Walter I^eighton, Getson Levin, 
Jacob E. Lieberman, Sophie Marcuse, Etlielyno L. McBee, William J. McCabe, Francis 
McIntyre, Dorothy Morrow, H. W. Norton, W. R. Pabst, Carl J. Rees, David Rosenblatt, 
M. Sandomire, Edward M. Scbrock, L, W, Shaw, John H. Smith, Frederick F. Stephan, 
F. M, Wadley, A, Wald, F. M, Weida, Samuel Weiss, B. S. Wilks, G. P. Young. 

The session Friday evening was devoted to the follo\ving contributed papers; 

1. Ealmahon of the Parameters of a Single Stochastic Difference Equation in a Complete 
System. 

T, W. Anderson and H. Rubin, Cowles Commission for Economic Research 
M. A, Girshick, Bureau of Agricultural Economics 
Presented by T. W. Anderson 

2. Estimation of Linear Functions of Cell Proportions. 

J H. Smith, Bureau of Labor Statistics 

3. On Functions of Sequences of Independent Chance Vectors with Applicalions to ike 
Random Walk Problem in k dimensions. 

D. Blackwell, Howard University 

M. A. Girshick, Bureau of Agricultural Economics 

Presented by D, Blackwell 

4. The Exact Power Curve and Dislnbulton of n for the Sequential Binomial Probability 
Ratio Test, 

M. A. Girshick, Bureau of Agricultural Economics 

At a business meetmg following the session of contributed papers, Professor 
F. M. Weida and Dr. John H. Smith were elected to succeed Colonel Kullback 
and Dr Madow as members of the Program Committee. 

The program for Saturday morning was devoted to the following invited 
lectures: 

1. Recent Developments in the Measurement of Simultaneous Economic Relations. 

T. Koopmans, Cowles Commission for Economic Research 

2. StructiiTol Estimation versus Regressions’ use for Policy and Prediction, 

Leonid Hurwicz, Cowles Commission for Economic Research 
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The program for Saturday afternoon was devoted to the following: 

1. Basic Concepts Underlying Sequential Analysis with Applications, 

A. Wald, Columbia Univeraity. 

2. Applications of Sequential Analysis to Acceptance Inspection. 

W. B. Pabat, Navy Department 

Irving Siegel, Veterans Administration, was chairman for the morning session 
and Professor F. M. Weida, George Washington University, for the afternoon 
session. 

A lively discussion followed the presentation of the papers. 

S. Kullback, 

Secretary, Washington Chapter. 




SAMPLE CRITERIA FOR TESTING EQUALITY OF MEANS, EQUALITY 
OF VARIANCES, AND EQUALITY OF COVARIANCES IN A 
NORMAL MULTIVARIATE DISTRIBUTION 

By S. S. Wilks 
Princeton University 

Summary. In this paper statistical test criteria are developed for testing 
equality of means, equality of variances and equality of covariances in a normal 
multivariate population of k variables on the basis of a sample. More spe¬ 
cifically, three statistical hypotheses are considered; (i) Hmve, the hypothesis 
that the means are equal, the variances are equal, and the covariances are 
equal, (ii) Htc , the hypothesis that variances are equal and covariances are 
equal, irrespective of the values of the means, and (iii) H„, the hypothesis of 
equal means, assuming variances are equal and covariances are equal. 

Test criteria Lmvc, L^c, and Lm are developed by the Neyman-Pearson method 
of likelihood ratios for testing and Hm respectively. The exact 

moments of each of the three test criteria when the three corresponding hypoth¬ 
eses are true are determined for any number k of variables and for any size, 
n, of the sample for which the distnbutions exist The exact distributions of 
Lmvc and L,c are determined for fc = 2 and k = 3, and the exact distribution of 
Lm is found for any k; these are all beta (Pearson Type I) distributions Tables 
of 5% and 1% points of , L«e and Lm, based on Thompson’s tables of 
percentage points of the Incomplete Beta Function, are given for certain values 
of k and n (Tables I and II). Also tables of values of approximate 5% and 1% 
points of —n In , --n In L„c and -•n(k—l) In L™ for large values of n are 
given (Table III), based on the fact that these three quantities are approximately 
distributed according to chi-square laws for large values of n with ^k{k -f- 3) —3, 
^kik -f 1) — 2, and k — I degrees of freedom respectively. A table (Table IV) 
is given which shows how accurate the resulting approximate 5% and 1% points 
of L mvc j Le and Lm are 

The paper is written in two parts. In Part I the problem of testing the three 
hypotheses is discussed and the mathematical results are presented together 
with an illustrative example. Part II is given for the reader who wishes to study 
the mathematical derivation of the results. 

I. The Problem and a Statement of Results 

1.1. Introduction. Situations occasionally arise, m which it may be desired 
to test the hypothesis that the means are equal, the variances are equal and the 
covariances are equal in a multivariate population m which the variables are 
correlated, the test to be made on the basis of a sample from such a population. 
In the case of a normal multivariate distribution this means testing the hypo¬ 
thesis that the distribution is symmetric with respect to the variables. 
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As an example* suppose three “parallel forms” of a test are constructed and 
all are given to a group of n college entrance studentB. On the basis of the 
scores of the n students on the three tests, how could one test the hypothesis 
that the three tests are really parallel forms, as far as means, variances and 
covariances are concerned? In other words, how could one test the hypo- 
thesis that the scores can be regardetl as being from a sample of individuals 
from a college entrance population of individuals in which the distribution 
function of the three variables is such that the means of the three variables are 
all equal, the variances are equal and the covariances are equal? Actually, ns 
far as practical considerations arc concerned in testing work, it is frequently 
sufficient to consider only normally distributed populations So therefore one 
may raise the question as to how to test the hypothesis that the three-variable 
sample can he considered as having come from a normal three-variable popula¬ 
tion which is symmetrical in the three variables, i.c. a normal population in 
which the means are equal, the variances are equal, and the covariances are 
equal. Or more generally, one may raise the analogous question, for the case 
of I, variables. 

Similarly, one could mention biological examples which have been treated by 
intra-olass correlation methods and raise the question as to whether the under¬ 
lying multivariate distribution can be judged to be symmetric in the variables 
on the basis of information supplied by the sample. 

To attempt to deal with this problem by comparing means, or variances or 
covariances two at a time or performing what might appear to Ire extensions of 
existing tests for two or more independeni samples of one variable leads to com¬ 
plications because of correlation among the variables in the original population, 
What is needed is some kind of a comprehensive test which will take into account 
all means, variances and covariances at one time. If it turns out that the hypoth¬ 
esis of equal means, equal variances and equal covariances is not supported 
by the sample, then one can raise the question as to whether the sample supports 
the hypothesis that the variances are equal and covariances are equal irrespective 
of means. If the answer is yes here, one can ask the further question as to 
whether the sample supports the hypothesis of equal means. Such tests will be 
developed in this paper for samples from a normal multivariate population. 
More specifically three tests are developed, (i) Test for testing the hypoth¬ 
esis that all means are equal, all variances are equal and all covariances 
are equal, (u) test Avc for the hypothesis Hve that all variances are equal and 
all covariances are equal, irrespective of the values of the means, and (iii) test 

* The problem treated m this paper arose from diBcuBBioiia Professor Harold 0. 
GuUikseo, of the Psychology Department of Princeton University, in connection with the 
problem of testing whether two or more forms of an examination can be considered as 
"parallel forms”. The author would like to take this opportunity to acknowledge various 
helpful discussions he has also had with his colleague Professor John W. Tukey in con¬ 
nection with this paper 
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Lm for the hypothesis that the means are equal, assuming that is true, 
i.e. that the variances are equal and the covariances equal. 

There aie rather obvious extensions of the hypotheses and 

’and then corresponding test criteria. For example, one could divide the vari¬ 
ables in the multivariate population into two sets, and consider the hypothesis 
Hmvc (say), analogous to H'^ve , that the means are equal, the variances are equal 
and the covariances are equal within each of the two sets and that the covariances 
of variables between the two sets are all equal. Similarly, and Zfm' could 
be defined so as to be analogous to and . However, these extensions 
will not be considered in this paper 

In Part I of this paper we shall discuss the problem of testing hypotheses 
regarding equality of means, equality of variances, and equality of covariances 
in a normal multivariate population, and summarize the mathematical results 
which have been obtained. An illustrative example will also be given. The 
derivation of the test criteria and their sampling theory is presented m Part II 
of the paper. 

1.2. The hypotheses to be tested. We assume that theie is a fc-vaiiate 
population n in which the variables Xi, Xi, • • • , xk are distributed according to 
a normal fc-variate piobability density function such that the mean value of 
18 a, {i = 1, 2, • • ,k) and the variance-covariance matrix oi Xi, X 2 , ■ • • , Xk 
is 11 Putr.cfi 11, pv, being the correlation coefficient between a;< and x,{i 7 ^ j), and 
(T, bemg the standard deviation of x.. 

In specifying the hypotheses to be considered it will be convenient to define 
three conditions on the parameters of population 11: 

Condition Cm- that the means of the x, are all equal. 

Condition C,: that the variances of the x, are all equal. 

Condition Cc- that the covariances of the x, and x, {i 7 ^ j) are all equal 
The hypotheses regarding 11 to be tested are as follows: 

Hmvc- that conditions , C,, and Cc hold 

Ht,c: that conditions C„ and Cc hold 

Hm: that condition C,„ holds, assummg that is true. 

A precise statement of these hypotheses in terms of Neyman-Pearson likeli¬ 
hood ratio terminology will be found in Part II. 

It should be noted that Hr^vc is a comprehensive hypothesis which specifies 
equality of means, equality of variances and equality of covariances and would 
be tested if one is interested m all of these quantities as a system. On the other 
hand Hvc refers only to equality of variances and equality of covariances re¬ 
gal dless of what values the means may have. i?„c would be tested if one is only 
concerned with equality of variances and equality of covariances. Hm is a more 
restrictive hypothesis than either or , for it refers to equality of 

means under the assumption thsA H^c is true. In other words, Hm can only be 
tested accurately when H^c is true, Hm would be a generalization of therBehrens- 
Fisher problem [1] when is false. 
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1.3. The sample test criteria. The three hypotheses and H„ are 

to be tested on the basis of a sample 0„ from TI consisting of the following values 
of the x’s'. x,a, i = 1, 2, ■ ■ ■ , k, a == 1, 2, ■ • ■ , n. 

The criteria for testing , and H„, depend on the following quantities 

to be determined from the sample: 

(1.1) = « = 


(1.2) 

(1.3) 


1 -K ' 1 


n ft -1 


72< a**l 


-It/- 


8^r = 


kik - 1) 


2 " 


The sample criteria, baaed on the method of likelihood ratios, for testing 
Hmvc , ri'un and H„ are respectively, as follows: 

(1.4) Lnva = Z/»«‘TSr^ 


( 1 . 6 ) 

(1.6) 


_Kd_ 

~ (sVd - rr\l + (k- l)r) 

_8^(1 — r) _ 

a'(l -r) + 2 {Si - Sf 

/c — i i—i 


where 1 s,y ] is the determinant of sample variances and covariances. 

The range of values of each of the three criteria is from 0 to 1. A necessary 
and sufficient condition for each criterion to have the value 1 is that the hypoth¬ 
esis for which the criterion is a test be (accidentally) identically supported 
by the sample. If the hypothesis (any one of the three being considered) is 
true, the average value of the corresponding criterion will be less than 1, but 
this average value will be nearer 1 than when the hypothesis is false. 

If H„„c is true (i.e., found to be supported by the sample on the basis of the 
test Lmrc) then there will be three parameters which characterize 11, namely, a 
(the common mean), x (the common variance), and p (the common correlation 
coefficient). The best estimates of these three parameters are, respectively: 

£ ~ j. 2 » 

K i-1 


(1.7) 



To 



1 

k{k - 1) 



If is true (i.e , found to be supported by the sample on the basis of the 
test L„o) there will be A: 2 parameters which characterize IT, namely the means 
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fli I 02 ) , 0-k, 0 ^ (the common variance) and p (the common correlation coeffi¬ 

cient). The best estimates of these parameters are, respectively 

(1.8) , ^ 2 , • ■ • , xj., s’, and r. 

In order to be able to use the three sample criteria Lmve, Lvc and L„ for testing 
the hypotheses Hmve, H^c , //» , it is necessary to have their distribution func¬ 
tions under the assumptions that the respective hypotheses Hmvn, Hve and Hm 
are true. 


1.4. Sampling theory of the test criteria. The moments of the exact sampling 
distributions of Lmvc and L„c when Hmvo and Hvc are true respectively, have been 
determined for all values of k (number of variables) and all values of n (sample 
size) for which such distributions exist; ie., for fc > 2 and n > k. The ^-th 
moments of the distributions of the two criteria are as follows: 


(1.9) 


M,(L„.,) = (fc - 1)«'‘-’' 


r(^(n — i) + g) 

fi r(K» - t)) 


_ r(K^ - i)») 

ra(ft - l)(n -1) + g{k - D) 


and 


( 1 . 10 ) 


MM = (k- I)"'*-'’ 


k 



r(K» - i) + g) 
r(Kn - i)) 


r(Kfc - i)(n - 1)) 
r(Kfc - l)(n - 1) + g{k - !))■ 


For the cases of fc = 2 and fc = 3, these moments simplify so that the distribu¬ 
tion functions of Lmvt and L„c can be readily inferred. They turn out to be as 
follows: 

For fc = 2: 

(1.11) dF{M = i(n - 2)(Iw„)*'"-‘>dL„,. 

(1.12) dF(L..) = (1 - L,c)"* dU ,. 

Virr(§(n - 2)) 

For fc = 3: 


(1.13) dFM = (1 - dM, 

dF{U.) = (Vl7.)"-*(1 - VZ:.) . 


(1.14) 



262 


B. S, WIIiKS 


The distributiofl funetion of when the hypotheaiB is true has been found 
to be 


(1.15) 


dF{L^) = 


n\n{h - D) 

Vilin -\){k- i))r(Kfc - 1)) 




dL„. 


Details of the dei-h ation of these distribution functions will be found in Part 
II. 

In a paper published elsewhere in the present issue of the Annals of Mathe¬ 
matical Statistics, Tukey and Wilks [2] show how the probability integrals of 
Lm„c and L„c and of other statistical criteria haring moment.s of a rather general 
class can be fitted by Incomplete Beta Functions in such a way that all moments 
of the fitted distribution agree with those of the actual drstiibution up to and 

including terms of oidei - , 
n 

It will be noted that the probabihtj' integrals of L,„„c and L„o for k = 2, those 
of -s/hmvc a.nd VZ^c lor k = 3, and that of L,,, for any value of k, are Incomplete 
Beta Functions [3], with the following values of jj and q: 


k 

criterion 

p 

s 

2 

.^TrtU c 

i(n - 2) 

1 

2 

IjVG 

Un - 2) 

h 

3 

"V^ Lnitic 

n — 3 

3 

3 

y/ L»j 

n — 3 

2 

fc 

Lfti 

1 

rH 

1 

Kfc - 1) 


Percentage points” of the distributions of these criteria for the cases men¬ 
tioned in this table can therefore be read from Thompson’s [4] tables of per cent 
points for the Incomplete Beta Function 5% and 1% points for Z/mr= and L„„ 
for fc = 2 and 3 are given in Table I for certain values of n. Table II shows 
5% and 1% points of Lm for certain values of a for fc = 2, 3, 4, 5 and 6. 


1.6. The equivalence of Lm and an analysis of variance test for a fc by n lay¬ 
out. One can set up a Snedecor F ratio for testing hypothesis Hm by setting 

(1 161 F = - l)(fe - 1)(1 - Lm) 

^ iik- DLm 

and entering the F tables with Wi = fc — 1 and na = (n — 1) (fc — 1) degrees of 

’ The 100f% point, say L« , of a given criterion L (any of those being considered) having 
distribution dF{L) is given by f * dF(L) = f 



SAMPLE CKITERIA 


263 


TABLE I 


B% and 1% points of Lmvc and L^cfor k = 2 and k = 3 




k = 

2 


k = 3 

TL 

VC 

L 

VO 

n 


L 

c 

5% 

1 % 

5% 

1 % 

5% 

1 % 

S% 

1 % 

3 

0 0025 

.0001 

0.0062 

.0002 

4 

0.00029 

0.00001 

0 00064 

0 00003 

4 

0500 

0100 

.0975 

0199 

5 

.0095 

.0018 

0183 

.0035 

5 

.1357 

.0464 

.2285 

0808 

6 

.0358 

.0112 

0618 

.0198 

6 

.2236 

.1000 

.3416 

.1588 

7 

0736 

0300 

.1174 

0493 

7 

3017 

.1585 

4307 

.2352 

8 

.1165 

.0559 

,1749 

.0866 

8 

.3684 

2154 

.5005 

.3039 

9 

1603 

0860 

2297 

.1272 

9 

.4249 

.2683 

5559 

.3637 

10 

.2028 

1181 

2802 

.1682 

iO 

.4729 

.3162 

.6007 

4154 

11 

,2432 

.1508 

.3259 

.2079 

11 

5139 

.3594 

6375 

4601 

12 

.2808 

.1829 

3670 

2457 

12 

.5493 

.3981 

6682 

.4989 

13 

3157 

.2141 

,4040 

.2811 

13 

.5800 

4329 

.6943 

.5328 

14 

3480 

2439 

.4373 

.3141 

14 

6070 

.4642 

7165 

.5626 

15 

.3778 

.2722 

.4674 

3448 

15 

6307 

4924 

7358 

.5889 

16 

4052 

.2990 

.4946 

3732 

16 

6518 

5180 

.7528 

6124 

17 

.4306 

.3243 

.6193 

.3996 

17 

.6707 

5411 

7675 

6334 

18 

.4540 

.3482 

.5418 

.4240 

18 

,6877 

.5623 

.7807 

.6522 

23 

.5484 

.4482 

.6293 

.5230 

19 

.7030 

,5817 

7925 

.0693 

33 

.6660 

.5811 

7326 

.6470 

20 

.7169 

.5995 

.8031 

.6848 

63 

.8135 

.7591 

.8549 

.8029 

21 

.7294 

.6159 

8126 

6989 

QO 

1.0000 

1 0000 

1 0000 

1.0000 

22 

.7411 

.6310 

.8213 

.7119 






23 

.7518 

.6450 

8292 

7237 






24 

7616 

.6579 

8365 

7347 






25 

7707 

.6700 

8431 

.7448 






26 

7791 

.6813 

8493 

.7542 






27 

7869 

.6918 

8549 

7629 






28 

.7942 

.7017 

,8602 

7710 






29 

.8010 

.7110 

.8661 

7786 






30 

8074 

7197 

8697 

.7857 






3] 

.8133 

.7279 

.8739 

.7924 






32 

8190 

7356 

8779 

7987 






42 

8609 

7943 

.9073 

.8454 






62 

.9050 

.8577 

9375 

.8945 






122 

.9513 

.9261 

9684 

9460 






OO 

1.0000 

1.0000 

1 0000 

1 0000 
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TABLE II 
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freedom. Making use of the definition of s*, sj, r and ro in Lm , one finds that F 
can be written as 


(1.17) 


F = -^ / — 
(fc - 1) / (n - 




m -1) 


h n k 

where >Si = n'^ {Xx — xf, and Sj = £ S (a:.a — - x. + xf and 

V—-1 a«^l 

1 ” 

x't, = X,a . Thus, the use of Lm as a criterion for testing Bm is equivalent 

rC aatl 

to an analysis of variance test for testing “row” effects in a A: by n rectangular 
layout when rows are associated with the k variables in the multivariate popula¬ 
tion and columns are associated with the n individuals in the sample. 


1.6. Approximate sampling theory of the test criteria for large samples. In' 
the case of large samples, it follows from a theorem [5] concerning the distribution 
of likelihood ratio criteria for large samples that —n hi Lmvc, —nhiLtc, and 
—n(k — 1) In LmBXG approximately distributed accordmgto chi-square distribu¬ 
tions with \k{h -f 3) — 3, ^k{k -1- 1) — 2, and fc — 1 degrees of freedom respec¬ 
tively. Approximate 5% and 1% points of these three quantities taken from 
Thompson’s [6] tables of the percentage points of the chi-square distribution 
are given in Table III 

Table IV is given in order to furnish some idea of how the accuracy of the 
approximations provided by Table III depend on n. It will be noted that the 
approximate values exceed the exact values in every case, differences occuring 
in the third decimal place in almost every case in which n exceeds 60. The ap¬ 
proximate percentages to which the approximate per cent points correspond 
are given by the numbers in the parentheses in Table IV These numbers in 
each case were obtained by linear interpolation from the exact 5% and 1% 
points 


1.7. Comparison of L„c with Mauchly’s “sphericity” test. The criterion 
Lvc for testing hypothesis is, in a sense, an extension of a teat developed by 
Mauchly [7] for testing the hypothesis of “sphericity” of a normal multivariate 
distribution Mauchly’s test was designed for testing the hypothesis that all 
variances are equal, and that all covariances are equal to zero irrespective of the 
values of the population means The likelihood criterion for testing this hypoth¬ 
esis of “sphericity” is 


(1.18) 


L. = 


(sy 


which should be compared with L^c • Actually, Mauchly used L, as the test 
criterion, which, of course, is equivalent to using L.. The p-th moment of L, 
when the hypothesis of sphericity is true is given by 


(1.19) 




^ r T 

■n ' 

t-i L 


r(Kn - 0 + g)\ r(P(n - i)) 


r(K^ - *■)) J r(P(n - 1) + gk) 
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TABLE III 


Approximate 5% and 1% poinlft for —nlnLmvc, —nhxljot, and ~n{k — i)InL„ 

for k = fi, 4, S} S. 


k 


— nlnL 

m DC 


" nln t 



~n(fc-l)ln/:,„ 

d.f. 

5% 

1% 

d.f. 

6 % 

6% 

df. 

5% 

1% 

2 

2 

5.99147 


1 

3.84146 


1 

3.84146 

6.63490 

3 

6 

12 5916 

16.8119 

4 


13.2767 

2 

5.99147 

9.21034 

4 

11 


HI 

8 



3 

7.81473 

11.3449 

5 

17 

27.5871 


13 

22.3621 


4 

9.48773 

13.2767 

6 

24 

36.4151 

42.9798 

19 



5 

11.0705 

15.0863 


TABLE IV 


Table indicating the accuracy of the approximate B% and 1% points of Lmvc, 
and Lm provided by Table III 


criterion 

k 

n 

6% 

1% 

exact 

approx. 

exact 

approx. 

Ljnve 

2 

30 

0.8074 

0.8190 (5.53)* 

0.7197 

0.7367 (1.73)* 

Lmvo 

2 

62 

.9050 

.9079 (5.25) 

.8577 

.8619 (1.36) 

Lmvc 

2 

122 

.9513 

.9621 (5.13) 

.9261 

.9273 (1.19) 

Ljnvc 

3 

33 

.6660 

.6828 (5.79) 

.5811 

.6008 (1.88) 

Lmve 

3 

63 

.8135 

.8188 (5.40) 

.7591 

.7658 (1.49) 

Live 

2 

30 

.8697 

.8799 (5.49) 

.7857 

.8016 (1.76) 

Lpe I 

2 

62 

9375 

,9399 (5.22) 

.8945 

.8985 (1.37) 

Lfc 

2 

122 

.9684 

.9690 (5.11) 

.9460 

.9471 (1.20) 

Lpe ' 

3 

33 

.7326 

.7501 (5.82) 

.6470 

.6688 (2.01) 

hte 

3 

63 

.8549 

.8602 (5.41) 

.8029 

.8100 (1.55) 


2 

31 

.8779 

.8835 (5.28) 

.7987 

.8073 (1.43) 


2 

61 

.9375 

.9389 (6.13) 

.8945 

.8969 (1.20) 

■^frt 

2 

121 

.9684 

.9688 (5.07) 

.9460 

.9467 (1.13) 


3 

31 

.9050 

.9079 (5.25) 

.8577 

.8619 (1.36) 

L. 

3 

61 

.9513 

.9521 (5.10) 

.9261 

.9273 (1.14) 

L„ 

4 

41 

9372 

.9385 (5.19) 

.9101 

.9119 (1.26) 


5 

31 

.9246 

.9264 (5.26) 

.8961 

.8984 (1.32) 


*The numbers in the parentheses are approximate percentages (obtained by linear 
interpolation) to which the approximate percent points correspond. 


which should be compared with the g-th moment of . Stated in other words, 
Mauchly’s criterion L, is a test for the hypothesis that contours of equal proba- 
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bility density m the multivariate normal population distribution are spheres, 
while Lvc is a test for the hypothesis that the contours of equal probability are 
fc-dimensional ellipsoids with fc — 1 equal axes in general shorter than the fc-th 
axis which is equally inclined to the k coordinate axes of the distribution 
function. 

1.8. Illustrative Example. As an example to illustrate the use of the test 
criteria Lmvc, L^c, Lm, we shall consider data on three forms of a subtest in 
verbal aptitude, and inquire as to whether the data are consistent with the 
hypothesis of the three forms being “parallel forms”. 

A procedure® was used for partitioning the first 60 of an entire test of 80 items 
mto three sets of 20 items each by using only a “difficulty ’ and a “validity” 
index on each of the items. A random sample of 100 test booklets was selected 
from those in which the first 60 items had been attempted. Total scqres were 
obtained on each of the three subtests selected in this manner. The question 
IS this: Does this procedure of selecting items produce “parallel” subtests? 
In other words considering the three scores on the three subtests in each of the 
100 test booklets as a sample of 100 items from a trivariate normal population 
is the sample consistent with the hypothesis iimve of equal means, equal variances 
and equal covariances? If not, is the sample consistent with the hypothesis 
of equal variances and equal covariances irrespective of means? If the answer 
to this question is no, then the failure of the tests to be parallel is at least partially 
attributable to differences m variances and/or differences in covariances. If 
the answer to the question is yes, we test Hm , the hypothesis of equal means, 
assummg equal variances and equal covariances. If the sample is not consistent 
with Hm , then the subtests fail to be parallel because of significant differences in 
means. 

If we denote the three subtests by Ti, Tt, Tt, and the scores on the a-th 
individual in the sample on the three tests by xia, respectively (a = 

1, 2, • • , 100), the information in the sample needed for computing Lmvo 

and Lm and testing H-^vc, and is as follows; 


Xi = 10 9900 

s® 

= 17.5558 

X2 = 10 9300 

So 

= 17.5764 

ra = 11 2600 

r 

= 7963 

sii = 16.8451 

To 

= .7948 

S 22 “ 18.1099 

1 s.tl 

= 545.5308 


sj3 = 17.7124 
fii2 = 13.5493 
Si3 = 14.5826 
S23 = 13 8056 

’Devised by Mr L.R Tucker of the College Entrance Examination Board. The author 
is indebted to Mr Tucker for the data used in the illustrative example. 
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Using formulas (1.4), (1.5), and (1.6), for * = 3, for calculating the values of 
Lmvc , Lvc and Lm, we find 

Lmvc ~ .9209 
L,, = .9370 
L„ = .9914 

It will be seen from Table III that the 6 % point of —n In L™,,, for 
Ifc = 3 IS 12.5912. Setting — lOO In = 12.5912 and solving we find the 
approximate 5% point of to be .8817 which is considerably less than the 
observed value of Lm,o, namely .9209. Hence, the sample is consistent with 
Tfmuc. As a matter of fact the observed value .9209 lies at approximately 
the 25% point of L miic ■ 

In practice, there would be no point in proceeding to test H„c or H„, because 
if Ln„o is non-significant there is a high probability (not certainty) that both L„ 
and Ln will be non-significant. But for illustrative purposes, it is perhaps useful 
to consider L^c and Lm anyway. 

The 5% point of —n Irt L,, for /o = 3 is 9.48773 (See Table HI), Setting 
— 100 In 7j„e = 9.48773 and solving, we get .9095 as the approximate 5% point 
of Lvc, which is considerably less than the observed value .9370, thus indicating 
that Lyc is not significant at the 5% level. In fact the observed value .9370 
lies between the 25% and 10% pomt of L,». 

The 6% point of —n(k — 1) In Lm for lb = 3 is 6.99147. Setting — 200 In Lm = 
6.99147 and solving we get .9704 as the approximate 5% point. Since the ob¬ 
served value of Lm exceeds .9704, we find Lm not significant at the 5% level. In 
fact, the observed value .9914 lies between the 60% and 25% points. 

II. Derivation or Results 

In this part we shall derive the criteria L«,,, Lv» and Lm for testing Hmti ,, 
Hyc and Hm by the Neyman-Pearson method of likelihood ratios, and determme 
the distribution theory of the criteria. 

2.1. The test Lm^^for Hmvy, the hypothesis of equality of means, equality of 
variances and equality of covariances. 

2 .1.1 Denvattm of the enterion Lm»o. Let n be a normal fc-variate population, 
in which xi,xt, ■ ■ • , a;* are variables, such that a, is the mean oixi, a] the vari¬ 
ance of X, and the covariance (p,^ the correlation coefficient) between 

Xi and Xj The distribution law of xi, a;j, • • ■ , a:* in the population, is 

^(^^ Ai,{xi - a.)(a:; - a/) j 

where (j Av/ jj is symmetric and is the mverse of the variance-covariance matrix, 
i.e. llAiiir' = |lp.,ff,ff,|l, (p., = 1 ).- 

Now suppose Ob is a random sample of n individuals from population II, 
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and let iio be the value of the for the ath individual in the sample, 
the probability function for the entire sample (likelihood function) is 


( 2 . 2 ) 





Then, 


The hypothesis which we wish to test is that the population means oi, 
Oj, • • • , a* are equal, the variances <r\, al, , 0 -* are all equal and the covari¬ 
ances PwtritTi, pizoidz , ■ • ■ , Pi-i.uri-iff*; are all equal, the test to be made on the 
basis of the sample of values x.a • In other words, we wish to test the hypothe¬ 
sis that 


(2.3) 


( fli = Oj = ■ • • = Oj = o 


Tl 2 




2 

2 

2 

<ri 

pijtriffj • • 

Put O'! Ok 


<7 

pa ■ 

- pa 


2 



2 

2 

2 

p21 ffi o-j 

ffs 

P2k trs 


ptr 

<T 

• pa 


• 

2 


2 

2 

2 

Paffiff* 

pia<^2Ck *• 

Ok 


pa 

pa • 

.. cr 


Testing the hypothesis that (2.3) holds is equivalent to testing the hypothesis 
that 


oi = Os 


— Oh = a 


(2.4) 


where 


All Aij • • • All, 


A B “ 

• B 

A 21 A 22 


B A ■■ 

• B 

All All 


B B 

■ A 


(2.5) 


1 + (k- 2)p 

Al -p)(l + (fc-l)p)’ 


B = 


__ 

a^(l ~ p)(l + (fc - l)p) • 


To obtain the likelihood criterion for testing the hypothesis Hmna we 
maximize the likelihood (2.2) under two conditions, for the given sample 0„, 
and take the ratio of the two resulting maxima. Fu'st, we maximize (2.2) over 
the set SI of admissible values of the parameters, i e with respect to all means 
fli and all variances and covariances p^,(Tx<Tj , denoting the resulting maximum 
of (2,2) by Pa • Secondly, we maximize (2.2) over the set of values w of the 
parameters which satisfy the hypothesis 'Hmv <:; that is, we replace in (2.2) each 
mean Oi, Oj, ■ ■ ■ , a*, by a, and each of the variances a-\, trl, cl hy <t^ and 
each of the covariances pxjCxC,, (t j), by pv* and then maximize (2.2) with re¬ 
spect to a, 0 - , and p, denoting the resultmg maximum by . 



270 


H. ti. WILKS 


Maximizing (2.2) under the first set of conditions is eriuivalent to maximizing 
it with respect to the a., and tlie A.j, while maximizing (2.2) under the second 
set of conditions is equivalent to imposing condition (2.4) and maximizing it 
with respect to a, A and B. 

The valued of the a, and d.,, which maximize (2.2) under the first set of condi¬ 
tions are given by solving the following (A:* + 3A:)/2 cciuations. 


(2.6) 

dP 

3a, 

= 0, i 

= 1,2, 

• ,fc 


(2.7) 

-^ = 0 
dA., ’ 

h i = 

1.2, 

, k, a < 3). 


Expressions for these equations are 




(2.8) 

■" * 

L 3-1 

-a.)]p 

= 0. 

i = 1,2, ■ ■ ■ , k 


(2.9) 

4 

1 

1_I 


“')> - 

0, i,j, = 1,2, ••• 

. k,(i < j), 

where 

A*^ is the element in the fth row 

' and jth 

column of || ||~\ 

i.e. 

a‘^ = 

P(;(r,(rj, and 

71 a 





The 

solution of (2.8) and 

(2.9) is 




(2.10) 

O/ = 


,3, , 

k 



A^^ = s, 

ij, or Ai, 

= 3*^, 

II 

(* < i) 


1 ” 

where Stj = - 2(x.a — £/)(x,a — £j), and where || s*'’|| = || s;y||.'"\ In- 

n anl 

sertmg the values of (2.10) in (2.2) and noting that the exponent in (2.2) re- 
duces to — ^ Zj which in turn reduces to —^/cn, since 22 — 1 

'.j“i ,-,1 

for each value of j, we obtain 


VO = I- li -T f - . 

I s,j P"(2 t)**" 

In order to obtain Pu, we specialize the a, and the matrix || Ajj || in (2.2) 
in accordance with (2.4), noting that the determinant | A ,-/1 reduces to 
(j4 — B) + (k — 1)B), thus obtaining the following specialized form 
of (2.2) 


(2.12) p/ - KA - Bf-\A + (fc - DB)]*" 

(2x)*"‘ 


f r ^ n k 

exp < A22 22 (a;» - a)“ + S E S («.« - - a) 

\ I— i«l eval _ 
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The values of a, A and B which maximize P' are given by solving the following 
three equations 

U- 0 - S'-”- 

These equations are respectively 

\\iA - - a) + BE (E (a:.„ - a))'] P' = 0 

L a-l •-! ■»-! \*-l / J 


(2.14) 


J 

\l~r^ 


B ' A + ik-l)B 


~ it, E (^.a - af^P' = 0 

a“»l »=»1 _ 


[[=T^ + atI^b - 5 - “)] 

1 " 

Replacing a, by x, in (2.16) putting - E (^»-> ~ — Xj) = s,,, and 

n a-l 

setting 

’’ H n k 

X = E S *•« 

tl/C 1-1 
1 ” 

SOv, = - E (®»« - £)(^ja - X) = «„ + (X, - X)(X, - X) 
n 

k k 

(2.15) < ro = E so„/(/c - 1) E so» 

1-1 

= r E s.j — E (^1 - x)*1 j (fc -1) Te s.i + 2 (®. - x)“1 

1 «—1 J/ L*™! '“1 J 

So = E Sot.A = r Fz) s.. + E (^1 - 

^ 1-1 K Li-1 1-1 J 

we obtain as solutions of (2.14) 


_ 1 -b (^ — 2 )t'q _ 

So(l - r)(l + (fc - l)ro) 


B = 3 


5?(1 - 7-„)(l + (fc - l)ro) 


Substituting these in (2.12) we obtain 


[(s§)'‘(l - ro)"-' (1 +(/c-l)ro)]^"( 27 r)‘"" • 
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The likelihood ratio for testing hypothewis Il„„c is given by 




It will be convenient to use the - th root of Xmire an the test criterion for . 

n 


Denoting this criterion by Lnva, we have 


( 2 . 18 ) 




8,y 


(8?)'(1 - ro)*-^(l + (fc - l)ro) • 




The use of L„,, as a test criterion is obviously equivalent to the use of . 

It will be seen that Lm»c is equal to unity when and only when the sample 
means are all equal, the sample variances Sa are all equal and when the 
sample covariances s,/, (i 9 ^ j), are all equal. The greater the departure of 
sample means from equality, sample variances from equality and sample co- 
variances from equality, the smaller will be the value of L„«c, its value, of course, 
always remaining between 0 and 1. 

2,1.2. Approximate dtstnbution of —n In Lmvi, in large samples. In order to 
make use of Lmvc as a criterion for testing hypothesis Hmvo we must find its 
sampling distribution under the assumption that Hmvc is true, i.e. that our sample 
has, in fact, been drawn from a fc-variate normal population having equal means, 
equal variances and equal covariances. In the case of large samples, it follows 
from a theorem on asymptotic distributions of likelUiood ratios [6] that — 2lnXm,o 
(i.e, -n In L„„o) is approximately distributed according to the chi-square law 
with ^k{k 3) — 3 degrees of freedom (obtained by taking the difference ber 
tween the number of parameters used in maximizing P to obtain Pa and that 
used in maximizing P' to obtain Pu). 

Thus, to apply the test, one computes the value of —n In L^ve for the given 
sample, and sees whether the obtained value is significant at the given probability 
level (5% or 1%) using the chi-square table for ^k{k + 3) — 3 degrees of freedom. 

To make a study of how closely the chi-square distribution approximates the 
exact distribution of — n In for various values of fc and n would be an ard¬ 
uous task in computation. But existing experience with approximations to large 
sample distributions mdicates that the • approximation in the present problem 
would be satisfactory for small values of h (say not more than 6) and values 
of n not less than 60 Some light is thrown on this question for k = 2 and 3 
by Table IV 

2,1 3. Moments of the exact dntribvlion of . In Section 2.1.2 an approxi¬ 
mation is given to the distribution of —n In for large samples. As a matter 
of fact, one can find expressions for the moments of the exact distribution of 
kimvc , which for the cases of fc = 2 and fc = 3 yield simple expressions for the 
exact distribution of 
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To find the moments of it will be noted that if one sets 


— di] 


— Qqij 

in expression (2.18) for Lmnc , the following expression is obtained for 

(2.19) L... = [^] 

where 


( 2 . 20 ) 


1 1 * 

= r 2 Ooi. - jjr-rr 2 

h <-i kik - 1) i^iLi 


OOij 




dou + 



It will be seen that Lmvc depends on the and the a,f . In the case of a sample 
from a general normal multivariate population, we know the o,^ to be distributed 
according to the Wishart [8] distribution function 


( 2 . 21 ) Wn-iMatr, A<,) 


Ati 


l“-’>|a„r-->expr-i £ 

_ L <,;-l J 

fl r(Kn - i)) 

1-1 


and the means i. to be independently distributed according to the normal dis¬ 
tribution 



where the and a, were defined m (2.1). 

We now define a function <p{g, u, v) as the mean value of | an [" when 

Hmvc is true, i.e , 

(2.23) v{g, % v) = E{\ a., "+'''■>) 


where the right hand side denotes multiplication of (2.21) by (2.22) (after im¬ 
posing condition (2 4)) by ] a,, I'e”®'’*’''®'' and then integration with respect to 
the o,, and 5,. This yields 


<p(g, w. y) 

(2.24J 


= 2"* IT r r(l(» - t) + g) 1 
i-iL r(Kn-t)) J 

^ (A- + (k- !)£)*'"-'’ 

^ 7~, I 2U • 

U - B - ^4ri) iA + {k~ l)B - 2t))‘t'-»+i' 
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Now the gth moment of L^vc is defined by 

(2.25) = E[{L„.cY] 

and is obtained by evaluating the partial derivative 


(2.26) 


ara-n+i 




at M = u = 0, and then putting r = —g and a = ~g. The validity of this 
operation for the range of values of g in which we are interested can be estab¬ 
lished by an argument based on analytic continuation. Alternatively, the same 
result can be achieved by taking the indefinite integral of <p r{k — 1) times suc¬ 
cessively with respect to n, and a times successively with respect to v (the lower 
limit of integration being — oo in every case) and then evaluating the final 
result at M = t) = 0. Accordingly, we obtain for the ^h moment of Lmoe, 
when hypothesis Hmtc is true, the following expression 


(2.27) 


Mail ) = TTr r(K« - i) + g) l 

ff( ..») 11 [_ r(Mn-O) J 

X(k~ r(i(n - l)r(in(fc - D) 

^ r(Kn - 1) + (7)r(in(fc - 1) -f ff(fc - 1)) • 


2.1.4. Diatributi'm of for k = 2 and 3. For k <= 2, the criterion Lmto 
can be expressed as 




«ii Su 


M I—■. m 


5 J 1 SZ 2 


■“^tnvo 

}(8u + ««) + i(^l — ^)’ 

81 J — 1(^1 — 


Sjl “ — ^a)* 

i(sn + 822 ) + 1(^1 ■" fe)* 


The gfth moment of L„„ for k = 2 (obtained by putting A: = 2 in (2.26) is 


(2.29) 


^^o(Envo} — 


r(§n)r(i(n - 2) -H p) 
r(^n + j7)r(Kn - 2)) 


(Hn - 2)) 

ih{n - 2) + g)’ 


and the distribution function of is found to be 


(2.30) dF(L„,;) = i(n - 2)L‘.tr’ , (0 < ^ 1). 

For fc = 3, can be written as 


Sll 

812 

8l« 

S51 

822 

823 

8jl 

Ss2 

S33 


(8o*)’(l - ro)*(l -f 2ro) 


(2.31) 
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■where si and ro are defined in (2 15) for fc = 3. Putting A; = 3 in (2.26) we 
find the gth moment of Lm„c for this case to be 

/o oo^ /r ^ r(l(« - 2) 4- (7)rCi(n - 3) + ff)r(n) 

(2 32) M.(L„.)-2 r(x(^_2)r(A(™_3)r(a + 2g)' ' 

By using the fact that 

r(/ + ^)r(< + 1 ) = y Vr (2^ _2^ ) ^ 

it is seen that M„(I;„„c) reduces to 

(2 331 M (L ) = •" 3 4~ 2g) 

from which we deduce the distribution of to be 

dP(L„.=) = _,-,y (\/Q’‘“‘(i - VLZ.)"dVL:::., 

(2 34) r(3)r(w - 3) 

(0 < L„.c < 1 ). 


For values of A: > 3, the exact distribution of Lmve seems to be too complicated 
to lend itself to ready computation. 

Thus, relatively simple exact tests of significance of Lmve can be set up for 
A = 2 and A = 3 by using distribution functions (2.30) and (2 34) respectively. 
For large values of n we have pointed out that the significance of L^vo can be 
tested by making use of the fact that — n In Lmyc is approximately distributed 
according to a chi-square law with ^A(A -f 3) — 3 degrees of freedom when Hmvc 
is tiue 

For A = 2, Lmvc is essentially a criterion for simultaneously testing, on the 
basis of a sample, the hypothesis of equality of means and equality of variances 
of a normal bivariate population 

It should be noted that if Hmvc is true, or more realistically, is supported by 
the sample as a result of applying test Lmvc, then population II is characterized 
by the three parameters a, ff and p in (2.3). The likelihood estimates of these 
parameters are x, So and To. 


2.2. The test L„c for , the hypothesis of equality of variances and equal¬ 
ity of covariances, irrespective of the values of the means. 

2.2 1 Derivation of the criterion ■ If, m testing hypothesis Hmvc by means 
of the criterion Lmvc, at a given level of significance, say <, a non-significant value 
of Lmvc is obtained, one states that the sample is consistent with the hypothesis 
Hmvc that all the population means are equal, the variances aie equal and the 
covariances are equal. Consideration of the Neyman-Pearson Type II error 
mvolved in this statement would be very arduous and involved and will not be 
attempted. On the other hand, if a significant value of is obtained, one 
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states that the sample contradicts the hypothesis H mfo with probability « of 
making a Neyman-Pearson Type I error. In this case it may be reasonable to 
inquire whether the sample would support the hypothesis if the variability 
due to the mqans were eliminated. In other words, we may inquire whether the 
sample supports the hypothesis of equal variances and equal covariances, 
irrespective of what values the population means may have. To obtain the 
likelihood ratio criterion L„o for testing wo maximize the likelihood (2.2) 
under the following two sets of conditions: First, with respect to the means aj 
and the variances and covariances p{j<Tt<Tj ; and Secondly, with respect to the 
means Oi and A and B, where A and B are obtained by imposing the condition 
on the matrix || An 1| specified in (2 14). The maximum of (2.2) under the first 
condition is given by (2.11). Denoting the maximum of (2.2) under the second 
set of conditions by P«', it is found, by a procedure similar to that used in finding 
Pa (given by (2.17), that P«< is given by 


(2.35) 


where 


(2.36) 


—|ti* 


[(s^)*(l - r)*-Kl + (A: - l)r)]»-'(2x)**'‘ 

* / * 
r = S / (^ — 1) 2 «.< 

s’ = X) 


The likelihood ratio X„ for testing fif,* is given by 


= [ 




(s')''(l - r)*-Hl + {k 


_r. 

- iwJ 


The test criterion which will be used for testing H,c is L»«, the -th root of 

71 

Xiic, i e.. 


(2.37) 


Ijvc — 


I 8./I 


(s’)*(l — r)*“*(l + (k — l)r)' 


2.2.2. Approximate distribution of —n In L,o in large samples. 

In'the case of large samples — n In !/»<, is approximately distributed according 
to the chi-square law with \kik 4-1) — 2 degrees of freedom when hypothesis 
is true. 

2.2.3. Moments of the exact distribution of . The moments of when 
H,c is true can be found by a method similar to that used in Section 2.1.3 for 
determining the moments of L„,c. For it will be noted that can be written as 


( 2 . 38 ) 


L - r I I 1 
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where 


(2.39) 

1 As 1 ^ 

R = rZa.. - E 

k {-1 k(k — 1) 

1 r * ^ T 


<s = r Z aw + E 0 ., , 

ft L<-i J 


from which it is evident that L,^ depends only op the a,,', whose distribution in 
the case of a general normal multivariate population is given by (2.21). We 
now define a function d{g, y, z) as the mean value of | under the as¬ 

sumption that Hta is true, i.e., 

(2.40) fl(ff,2/,2) = £(|o,y|V®+'®) 


where the value of the right hand side is obtained by multiplying (2.21) by 
I o.j I'e’'®''''®, then imposing the condition on || A,j || stated in (2.4) and integrat¬ 
ing with respect to the a,^. Accordingly, we find 


(2.41) 


Hg, y ,«) = 2 “‘ 


r(^(n — t) g) 

iA L ri(n — i) 


X 


{A - (A + (k- 1)B)“"~‘> 

\A-B- j (A -b (ft - \)B - 2*)*'’‘-'>+' 


The ^h moment M^{L^ of is obtained by evaluating the partial derivative 


(2.42) 


1 _ B 

dz‘ 


at y = z = 0, and then setting, r = —g and s = —g. These operations yield 


(2.43) 


Af,(L„) 


A r r(§(^ ~ i) + g) ~| 

fiL mn-i) j 


X (ft - 


r(Kn - i))r(Kfc - I)(n - 1)) 

TiUn -1) + g)Tm - l)(n - 1) + g{k - 1))’ 


2.2.4. Distribution of ij„ for k — 2 and 3. For ft = 2, L^c can be expressed 
as follows: 


L 


VO “ 


Sll «12 

_ I Sil Si2 _ 

i(Sll + ^ 22 ) fil2 

B 21 + sm) 


(2.44) 
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and the grth moment of is given by 
(2.45) 


M„(L„) = _l^r(^(n - 2) + g) 


r(K?i - 1) + g)r(^(n - 2)) 

from which ihe distribution of is deduced to be 

C2.46) dF(L„) - 


VirViUn - 2 )) 

For i; = 3, i„„ can be expressed as 


•■‘VC ) (0 < L„o < 1). 


(2.47) 


fjuo = 


Sn Si! 5 i 3 

S21 Si! S23 

S31 Sti S33 


^ , , (s^)’(l - r)^(l + 2r) 

where s and r are defined in (2.36) by setting k = 3. Setting /k = 3 in (2.43) 
we find as the gth moment of L„c ’ 

(2 48) M/y., ) = o^» r(Kn - 2) + g)r(^(n - 3) + g)r(n - 1 ) 

r(Kn - 2))r(Kn - 3))r(n - l + 2 g) ' 

Following the method by which (2.32) was reduced to (2.33), we find that the i/th 
moment of L„ reduces to 

(2.49) M„(L„,) = r(n - l)r(n - 3 + 2g) 

r(n - 1 + 2(;)r(n - 3) ’ 

and hence the distribution function of L«. for A: = 3 is 

( 2 . 60 ) dF(L..) = _ g-j (v'zre)"“'(i - vi:.) dvz:., (o < l„ < i). 

For higher values of k the distribution of is apparently too complicated for 
ready computation. But distributions (2.46) and (2.50) provide relatively 
simple significance tests for the cases A = 2 and 3, respectively. For large sarn¬ 
ies we re^k agam that a significance teat for L,, is provided by the fact 
^ m (i.e -n In L,,) is approximately distributed according to the chi- 
square law with ^fc(Ai + 1) - 2 degrees of freedom when is true. 

■ C ^ criterion for testing, on the basis of a sample, 

e hypothesis of equality of variances of a normal bivariate population, 
ani characterized by the parameters ai, a* , - ■ • , a* , v* 

_ p. The maximum likelihood estimates of these parameters are Hi, Xi, ■■■ 
Zk , a and r, respectively. 

for , the hypothesis of equaKty of means, when the 
variances are equal and covariances are equal. 
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2 3.1. Devivation of the cnierion . Suppose L^c, described in Section 2.2.1 
for testing H^c, the hypothesis of equal variances and equal covariances, does 
not have a significantly small value, thus indicating that the sample does not 
contradict the hypothesis Hvc • Then, assuming that the original test !/««« 
of ifmrc turned out to have a significantly small value, we may inquire as to 
whether the significance of Z/«.„o is due to the inequality of the population means 
0 ,, In this section we shall consider a criterion Lm for testing the hypothesis 
Hm that the means a, are equal, assuming that the variances are equal and that 
the covariances are equal. In this hypothesis we maximize the likelihood (2.2) 
under the following two sets of conditions: First, with respect to the Ci, A and B, 
where A and B are defined by the condition on |1 || given in (2.4); secondly, 

with respect to a, A and B where these parameters are specified by (2.4). The 
maxima of the likelihood (2.2) imder these two conditions are Pu>, and , 
given by (2.35) and (2.17) respectively. The likelihood ratio Xm is therefore 


(2.51) 


P. __ r (s^)*(l - r)*-\l + (fc - l)r) 
P<^' L(sJ)*(l - r«)*~‘(l + (fc - l)ro). 


Now it follows from the definitions of «*, , sS and ro, (2.15) and (2.36) that 


6*(1 + ik~ l)r) s s‘(l + (fc - l)ro) 


and hence we may write 
(2.52) Xj/" 


We can also express Xm " as 



(2.53) 



where Ro and R are defined by (2.20) and (2.39) respectively. 

It will be most convenient for our purposes to use Lm, the ^/n{k — l)]-th 
root of Xm . as the criterion for testing Hm , i-e. 


(2.54) 


Lm = R/Ro = 


s^l — r) 
so(l - ro) 


_ 8’‘(1 — r) _ 

5*(1 - r) 4- Z (S. - 

fC “ X tnl 


2.3 2. Approximate distribution of —n{k — 1) In Lm in large samples. 

In large samples — 2 In Xm (i.e., —n(fc — 1) In Lm) is approximately distributed 
according to the chi-square law with A: — 1 degrees of freedom. However, 
the exact distribution of Lm is relatively simple and will be derived. 

2.3.3. Exact distribution of Lm when Hm is true. We shall determme the dis¬ 
tribution of Lm by first finding the gth moment of Lm when Hm is true. For this 
purpose we set up the function 

(2.55) yPip. q) = F(e’’"+®"«) 
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where the mean value is taken when //« is true, i.e., when the a,- and || 4,j|| 
satisfactory conditions (2.4). Now R and iio arc functions of the o,y and X{, 
Hence, to find we multiply (2.21) by (2.22) by and impose 

conditions (2,4), then take the integral over the entire space of the o,y and xt. 
These operations yield 

,, , _ (A-.B)>"<*-» 

(2.66) ° ^ _ s _ _ B _ 


The ffth moment of L„ is obtained by performing the following differentiations 


(2.67) 



and then putting h = -g. These operations yield 


(2.58) 


M fL ^ - l)(fe -• 1) + g)r(^n(fc - D) 

r(Kn - l)(k - l))r(in(A: - 1) + p) 


from which the distribution of Ln (when Hm is true) is found to be 


dPiL.) -- 

(2.59) r(Kn-l)(k-l))r(KA:-l)) 

.(1 - L„)*'*”‘>“‘dL. . (0<I(„<1). 

Thus, we are able to make an exact test of significance of Lm on the basis of 
the function (2.59) 


2.4. Relations between Lm«c > acd Lm • 

It will be seen from the definitions of L„v^, L,„ and L„ in (2.18), (2.37) and 
(2.54) (noting that s’(l + (k — I)?-) s sj(l + (fc — l)ri,)) that 

bm»c = L„e'L^^ . 

Furthermore, it will be noted that when Hmvc is true, the ptli moment of Lmvc 
given by (2.27) is equal to the product of the pth moment of L^c given by (2.43) 
and the pth moment of Z/St^ (obtained by replacing g by g{h — 1) in (2.68). 
Thus, when Hmvs is true Xm«c is composed of the product of two independently 
distributed quantities, namely L,c and Z(J^^ 
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CONTRIBUTIONS TO THE THEORY OF SEQUENTIAL 
ANALYSIS, H, HI 

By M. a. Girshick 

Uniied Stales Department of Agriculture 

Summary. This is a continuation of a paper Part I of which was published in 
the June, 1946 issue of the Annals of Mathematical Slalislics. The present paper 
is divided into two parts, Parts II and III, which arc summarized as follows: 

Pari II. The Exact Power Curve and the Dislribuhon of n for Sequential Tests 
Where z Takes on a Finite Number of Integral Values. 

n 

Consider a sequential test defined by a decision function Z„ — ^ Znrvith 

aul 

boundaries —b and a where a and b are positive integers and Za is the oth ob¬ 
servation of a variate z which takes on a finite number of integral values ranging 
from the negative integer — rto the positive integer m with respective probabili¬ 
ties p-r, ■■,Pm. Let faf = PlZn = (u + z)], (t = 1, 2, ,m — 1), and fj, = 
P[Z„ =-(!>+ ^)], 0 = 1, 2, • • ■ , r - 1). Furthermore, let A be a square matrix 
of a 4- h — 1 rows and columns with elements defined by: o„ == 1 — po for all i; 

= -Pkfoi'k = 1, 2, ■ • ■,m;a,,i^j - —p^jforj = 1, 2, • • •, r,and Oj, = 0 
otherwise. 

It is proved that 

(0 i P«-rAr-f—{,6 , Q" — 0, 1, ■ • ■ ,7" 1) 

1~0 

m—7—1 

(^) ~ ) (j “ 

««-0 

where Aij is the element of the Ath row and hth column in A'*. Let Eojt" 
and Eh,r‘^ be the conditional generating function of n under the restriction that 
Zn = (a + j) and Zn = ~ib + j) respectively. Then ^b}Eb,T" is obtained by 
, substituting rp, for each p, occurring in equation (i) and fojEtfj'r’* is obtained by 
substituting rp, for each py occurring in equation (ii). The probability that 
= a -f i in exactly n steps is given by the coefficient of t’' in the expansion of 
fajEoir" in a power series in t. The probability that Z^ — — (h + hi exactly 
n steps is similarly obtained. 

This method is applied to the derivation of the exact power function and the 
distribution of n for the sequential binomial probability ratio test. 

Pari III. On Conjugate Distributions. 

Consider a random variable X with a distribution density fix, 8 ) which satis¬ 
fies certain specified conditions. Let 8 i and 6 ^ be two values of 8 and let z = 
log (fix, d 2 )/fix, 6 i)). For any hypothesis 8 = 8 ', let (pit | 8 ') be the moment 

282 
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generating function of z and /i the non-zero value of t for which ip{t \ 8') = 1. 
We set F{x) =e'%x, 8'). Then / and F are conjugate distributions. If 
F = f{x, 8"), then 8 ' and 6 " are defined as conjugate pairs. 

A method is given for obtaining the totality of conjugate pairs for the general 
class of distributions which admit a sufficient statistic -It is then shown that 
the power of the sequential probability ratio test based on such distributions is 
given explicitly in terms of these pairs. It is proven that within the approxima¬ 
tion obtained by neglecting the excess of \Z„ \ over o and 5 at a decision point 
the following relationship holds: 

P,{n\F) = e-“Pi(n|/) 

P.(.n\F) = e’^Pa(n\f) 

where Pi{n \ g) and Pa(n \ g) stand for the probability that Z„ > a and Zn < —i 
respectively m exactly n steps under the hypothesis g. 

II The Exact Power Curve and the Distribution oe n for Sequential 
Tests Where z Takes on a Finite Number of Integral Values 

2.1, General discussions. Let a sequential test be defined by a decision func- 

n 

tion Zn = ^Za with boundaries — b and o where a and b are positive and 

is the ath observation of a variate z which takes on a finite number of integral 
values, — r, r -|- 1, • • •, — 1, 0,1, 2, • • •, m. Let P (2 = i) = p, where P{z = t) 
stands for the probability that z takes on the value i. We shall assume without 
any loss of generality that a and b are integers. 

When the sequential test terminates with > a, the possible values that 
can take on are; o, a-hl, ■ • a m — 1. Blmilarly, when the sequential 
test terminates with < —b, the possible values which can take on are: 
-b, —Q) + 1), ■ ■ , -{b + r — 1). Let = P[Z„ = (o -f i)], i = 0, 1, • •, 
m — 1, and ft, = P[Z„ = — (b + i)], i = 0,1, • • , r — 1. 

For any variate u, let the/ symbol Euiu) stand for the expected value of u 
under the restriction that Zn = — (b + i), and the symbol Emiu) stand for the 
expected value of u under the restriction that Zn = a + Let <#>(<) be the gen¬ 
erating function of z. Then 

m 

(2.101) 4,(1) = Z P,t'. 

r 

In terms of the generating function, the Fundamental Identity (see section 
2.32 in [6]) can be written as 

r—1 wt—L 

(2.102) E -"’+‘>F6,[^(0J''" H- E = 1. 

t-Q 

It follows from (2.102) that for all values of t for which 

m 

4>(t) = E = 1. 

1=0 —r 


(2.103) 
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r—1 w—1 

(2.104) V'(i) =E + D 1 

i—0 i"^) 

where is the generating function of 2„ . 

In the paper “The cumulative sums of random variables’’ [2] Wald has given 
the following method for obtaining the probabilities foj and fjj. Let ti, h, 

• ■ be the r + m roots of (2.103). Substituting these in (2.104) we get 
r + m linear equations in the r m unknowns, fa< and . Thus, if the deter¬ 
minant of these equations is different from zero, the rxnknowns can be solved 
in terms of the roots of (2.103). In a similar manner, the characteristic function 
of n under the restriction that Zn — i can also be obtained. 

The above method has two disadvantages. First, it involves solving for 
all the roots of a polynomial which will often be of a high dqgree and second, it 
involves solving a set of linear equations with coefficients which are powers of 
complex numbers. 

The method outlined below is in many respects much simpler, It requires 
only the evaluation of one colunan of the mverse of a matrix of o h — 1 rows 
and columns. The elements of the matrix are given explicitly and are either 
0, 1 or pj. This permits obtaining general solutions for special classes of 
sequential tests. 

2.2. Derivation of the exact power functions. We multiply <A(0 — 1 by f 
and \(/{t) — 1 by and obtain two polynomials. 

(2.201) /(<) = i: (p/_, - 6,r)i^ 

l~o 

and 

(2.202) g{i) = 2 -f 2 

J-O ,-0 

where fi,* = 1 when i = k and zero otherwise. 

By the Fundamental Identity, every root of J{t) is also a root of g{t). Since 
f{i) is of degree m + r and g{t) is of degree a + b + m+ r — 2, it must follow 
that g{t) equals f{t) times a polynomial of degree a + b — 2.^ That is, 

a+k-2 

(2.203) g{t) =Sit)'£ cj 

where the c's are undetermined constants. Substituting from (2.201) in (2.203) 
we obtain 

(2.204) git) =2] Qj 

i-o 

^ It is assumed here that j[t) has no multiple roots. The author conjectures that this is 
true for the polynomial under consideration for all Values of p 
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where 

(2.205) Q, = E (p.-r - . 

i-O 

Comparing the coefficients of (2.204) with those of (2.202) and taking into 
aecount the fact that p* = 0 when k > m and c* = 0 when A: > o + 6 — 2, we get 


f-j-i 


(2.206) 

pi—f 3—1—1 J 

t-0 

(;■ = 0,1, • 

,r - 1), 

and 

(2.207) 

m— i—1 

^07 ^ Pi+3+1 2 t 

O 

II 

■ ,m - 1). 


«-o 


Thus, if the c’s (we require only the first r and the last m) are determined, the 
probabilities and fi, are also determined from (2.206) and (2.207). But, if 
we examine the structure of git) in (2.202) we see that the coefficients of all the 
powers of t from r to (a + i + r — 2) mclusive are zero except for the co¬ 
efficient of which is equal to —1 Consequently, if in (2.204) we set 
Qj = — , for all j = r, r 1, • , a -f- 6 -{- r — 2, we shall have the 

required number of equations to solve for the a -f- b — 1 unknown c’s. 

In view of (2.205) these equations can be written as 

7 

(2.208) E ~ P*-f)Cj-i = I 0’ = ^> • • • »a + b + r — 2) 

•-0 

Changing the range of the subscript j, we get 

j+r—1 

(2.209) E (S.r - Pv-r)Cj+,-,-i = 5,6, O' = 1, 2, • • ■ , o + b - 1), 

•-0 

with the understanding that p^ = 0 when h> m and c* = 0 when fc > o -b b — 2. 

Let A be the matrix of the equations in (2.209). Then A is of the following 
form. The elements in the main diagonal are (1 — po). In the diagonals to 
the right of and parallel to the main diagonal, the elements are — p_i, — p-j, • • ■, 
—p_r ,0, •■•,0 successively, m the diagonals to the left of and parallel 
to the main diagonal, the elements are —pi, —pj, • • •, —p™ ,0, ■ ■ - , 0 suc¬ 
cessively. Assume that the determinant of A is different from zero* and let 
A~’ be the inverse of A. Let the elements of be designated by A,-;, (i, j = 
1, 2, • • •, o -f b — 1). Then, in view of (2 209) we get 

(2 210) c, = Aj+i,6, (j = 0, 1, 2, ■■•,o-l-b-2). 

Finally, from (2.206) and (2.207), we have, 

ri-,-1 

(2.211) tb, = E Pi-r Ar-j—1.6, (j = 0,1, 2, * • • , r 1), 

1-0 


* P. L. Hsu has submitted a simple proof to the author that A is non-singular. 
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and 

}—1 

(2.212) ffl; — Ph 24-1 j 0 ■ Ij 2| ‘ , 771 1) 

»"»D 

where, as befoie, it is undersstood that p* = 0 when k > m and Aia, — 0 when, 
fc > a -h 6 — 1. 

From (2.211) and (2 212) we can obtain the probability that Zn < ~b and 
the probability that > a since these are given by 

r-l m—1 / r—1 \ 

2] and 2 {<./(= 1 - 5!^ fij) 

)-.o ,-o \ i-a / 

respectively We can also obtain Sn, the average number of steps required 
to reach a deci.sion For, if we differentiate (2.102) with re.spect to t and 
set / = 1, we get 

m—1 y—1 

+ t) — H f6,(h + i) 

(2.213) £?(n) = ^" = - 

£ iPi 

1—r 

2.3. Derivation of the probability that the sequential test will terminate in 
exactly n steps. Let be the generating function of z and V'(f, 0 the jomt 
generating function of Z„ and n. Then 

(2.301) = £ vJ 

»«*—r 

and 

r—1 m—1 

(2.302) t) = 2^ iu Eti r" + £ f., E^i t\ 

1-0 <-0 

Furthermore, let r) = T<l){t) — 1 and ^i(f, r) = ^(t, t) — 1 In terms of 
these functions, the Fundamental Identity can be stated as follows: For a fixed 
T, every root of r) is also a root of ^i(f, t). I^et f{t, r) = r) and 

git, r) = r). Then 

m+r 

(2.303) Kt, t) = £ (rp,-. - d,r)f 

}-0 

and 

(2.304) git, r) = E (io,F6,'r")r^-‘ - + E ({„,£„• 

1—0 1_0 

Since for a fixed r, every root of fit, r) is a root of git, r), and since fit, r) 
is a polynomial in i of degree m + r and git, r) is a polynomial in f of degree 
a + b + m — 2, it must follow that* 


' See footnote 1, section 2.2 



SEQUENTIAL ANALYSIS 


287 


Q +b-- 2 

(2-305) g{t, t) = f{i, t) 2 di t\ 

1-0 


The rest of the argument is identical with that of section 2.2 except that the 
unknowns in this case are and faAj’" and are given by 


(2.306) 

fbj Et,r’' 

r-j—1 

T'Pi— r dr—;— 1—1 j 

U = 0,1,. 

- -, r - 1), 

and 





(2.307) 

^ajEa,T^ = 

m—J—1 

i—2 » 

_n 

U = 0,1, •. 

■, TO - 1), 


(see (2.206), and (2.207)) where the d's are obtained by solving the linear equa¬ 
tions: 

3+r—1 

(2.308) g (5._, - Tp._) = 6,i, (j = 1, 2, ■ , 0 + 6 - 1), 

(see (2 209)). Thus, we see that the solution for fb.E’t.T" is obtainable from 
the solution given in 2.2 for fs, by substituting rp,- for every p, appearing in the 
expression (2.211). Similarly, the solution for fajEoTr" is obtainable from the 
solution given for |o, by substituting rp, for every p, appearing in the expression 
( 2 . 212 ). 

Let p(Z„ = A: I n) stand for the probability that Z„ = km. exactly n steps and 
let p^.in) = plZ„ = (a + i) In] and p6,(n) = p[Z„ = _(b + i) | „]. Then 
pa,(n) and pt,(n) are given by the coefficient of t" in the expansion of fai£?„T" 
and h,Et,iT” respectively in a power series in t. That the expansions are valid 
can be seen from the following considerations: If we examine the solutions given 
for foiSoiT" {i = 0, 1, • ■ - , TO — 1), and (f = 0, 1, • • - , r — 1), we see 

that each is a ratio of two polynomials in r, the polynomial in the denominator 
is, in each case, the determinant of the Imear equations (2.308). Now, it is easy 
to see that this determinant eqals 1 when t = 0. Hence the expansions are 
valid in a neighborhood of r = 0.'* 

Let Pan = p[Z„ > a 1 7i] and p6„ = p[Z„ < — b [ n]; then 

frt—1 

(2.309) Pan = p„{n) 

»-0 

and 

(2.310) P6n = ^ pw(n). 

t«0 

We have also: 

(2.311) E = E iai = p(z„ > o) 


* It can be seen from (2.303) that for a fixed t) = 0 implies that ip{t) = 1/t. Hence 
if T < 1, <p(j) > 1. Thus, the Fundamental Identity is valid in the neighborhood of t — 0. 
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aad 

to r—I 

(2.312) 2 P6» = 2 &. = P(^" < -b) 

TTlJ 

where mi is the smallest integer greater than or equal to a/m, and mj la the 
smallest integer greater than or equal to b/r. 


2.4. Application of the method to the binomial distribution. We shall 
consider the binomial in terms of acceptance inspection although the results 
are general. 

Let a sequential acceptance inspection plan be defined by pi, pi, a and /9 
where pi is the fraction defective which can be tolerated in the lot, pj is the frac¬ 
tion defective which cannot be tolerated, a is the maximum probability that the 
lot will be rejected when the fraction defective is pi or less and /9 is the maximum 
probability that the lot will be accepted Avhen the faction defective is p- or 
greater. Then the sequential criterion is given by two parallel lines ([1] and [3]). 

(2.401) di = — hi + sn 

(2.402) — hi + sn 
where 


(2.403) 


(2.404) 


hi = 


log 


1 - 

|8 


log 


P2(l - Pi) 
Pl(l - P 2 ) 
1 - ^ 


log 


hi = 


log 


Pijl - Pi) 
Pi(l - Pi) 


(2.405) 


s 


log 


1 — Pi 

1 — P2 


log 


Pi (1 - Pi) 
pi (1 - Pi) 


and n is the number of observations taken sequentially. We assume that 
« + /3 < 1 and pi < pa. Then hi and ht are positive and s lies between 0 and 1. 

The sequential procedure is as follows: Items arc examined one at a time in 
sequence. If at any stage, the cumulative number of defectives found in the 
sample thus far taken is less than or equal to di given by (2,401), the lot is ac¬ 
cepted; if the cumulative number of defectives 13 greater than da given by 

(2.402), the lot is rejected; if neither holds then another observation is taken 
and the process continued. 

It IS easy to show that the sequential test described above is equivalent 
to the followmg: A variatd z takes on the values — s and 1 — s with respective 
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probabilities q and p, A sequential test is defined by the two boundaries ~hi 

n 

and hi and by the decision function Zn = ^Za where Za is the ath observa- 

B—1 

tion on z. The sequential test terminates if and only ii Z„ < — ft.i or Z„ > / 12 . 

As was mentioned above, s lies between 0 and 1.® We shall derive the exact 
power and the distribution of n for this sequential test by assuming that s = 
u/v where u and v are integers and u < v. This restriction is not serious since 
every value of s can be approximated to any degree of accuracy by a rational 
fraction, and, moreover, when the sequential test is applied in practice, s is 
always taken as rational. 

Suppose s = u/v. Then the sequential test is equivalent to a test in which 
the variate 2 takes on the values —u and v — u with probabilities q and p, 
respectively, and the boundaries are given by —hiv and hiv. Let b be the small¬ 
est integer greater than or equal to hiV and a be the smallest integer greater 
than or equal to h 2 V. Then, since u and v are integers, there is no loss in gen¬ 
erality in assuming that the boundaries are —b and a. We shall also assume 
that u and v are prime to each other (i.e, the fraction u/v is reduced to lowest 
terms) so that the interval ( — b, a) is the shortest possible for this test. 

The above discussion shows that a sequential test based on the bmomial 
can be considered as a special case of the class of tests treated in this section. 
Smce 2 takes only on two values, the linear equations (2.209) assume the simple 
form: 

(2.401) —pC,+u-n-i + C— qCj+u-i = 5h,, (j = 1, 2, ■ ■ ■, a -h b — 1) 

where C* = 0 when k is negative or greater than a -1- b — 2. In terms of the 
C’b, the fjj and Ea, are given by 

(2.402) = qCu—]~i , 0, ~ li ' ■' I l)j 

(2.403) ~ D+j —1 t (j “ 0, 1, ■ ' ■ I U u 1) 

The conditional generating functions of n are obtained by solvmg (2.401) 
with rp substituted for p and rq substituted for g. 

Since the first v — u and the last u equations in (2.401) contain only two 
terms and all the other equations contain only three terms, the C’s can be ob- 
tamed without too much difficulty by direct substitution provided a -h b is 
not very large. When a + b is sizeable, a general solution is called for. So far, 
the author has been able to obtam this only for the case = 1. This special 
case also has been considered by Walter Bartky [4]. 

Setting u = 1 in (2.401) we get 

(2.407) -k C^^-qC, - hs, (j = 1, 2, ■ • ■, a + b - 1), 

where C* = 0 when k is negative or greater than o -f- b — 2. 


‘ In fact, it follows from Theorem 1, section 3.2 below that pi < s < p%. 
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Consider a general set of equations of the form (2.407) with the subscripts 
ranging from 1 to an arbitrary integer k. I^et the determinant of these equa¬ 
tions be designated by A* . Then by direct expansion it can be shown that 
A* satisfies the difference equation. 

(2.408) At = At.-i - 
with the initial conditions 

(2.409) A( = 1, i = 1,2, 1; A. = 1 - 

The difference equation (2.408) can be solved by well known methods. We set 

(2.410) <^(1) = 

and then multiply each side of (2.410) by 1 — i + This yields 

(2.411) (1 - a: + pq'-^x-)^(x) = 2 [Ay - Ay-i + Ay-Ja^-^ 


But by (2 408) and (2.409) we find that the right-hand side of (2.411) equals 
1 — pq’~''x'’~^. Therefore, 


(2.412) 


^ix) = 


1 - Pg 3= 

1 — ® -b P2’“‘ x’ 


If we expand (2 412) in a power series in a:, the coefficient of x* will bo At+i, 
This expansion can be performed readily and we get; 


(2.413) At+i ^ E (-1)^C5 -^"“‘\p!Z’’"V - Z (-l)^Cr“^'’“'’’^‘(Pg’“V^' 

J-O l-Q 


where mi stands for the largest integer less than or equal to,k/v mt stands for the 
largest integer less than or equal to ft— i; -f- 1 /v and d =rl/tl(r — t) 1. 

Let us define A® = 1 and A* = 0 when ft < 0. Then, in terms of the extended 
definition of Ai, Cy is given hy 


f2 4141 C, = A/-|,A„y.t_i 

^ ^ ^ g’-*«A.^_i 

for j = 0, 1, • ■ ■, a -t- b — 2. To prove this, we substitute in the left-hand 
member of (2.407) the expression for C* given in (2.414) and get 

(2.4151 ^m->-i(Ay-t — Ay-^^i -|- pq Ay-u-t) AB-.i(AyAy^i -j* pg* ^'Ay-n) 

But in view of (2.408), (2.409) and the extended definition of A*, the expression 
in (2.415) vanishes for all j ^ b. Whenj = b, [the expression equals 1. Hence, 
it follows that (2.414) is the desired solution. 

Let Lp = plZ„ < —6], Then Lp , when plotted against p, gives the operating 
characteristic curve for this sequential test. But Lp = qCa ■ Hence, we have 
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(2 416) 

As a final remark, we wish to point out that the solution to the sequential 
problem presented in this section, where taken in conjunction with Wald’s 
solution, is of mathematical interest, since it relates each element of the inverse 
of a square matrix (designated by A in this section) with the roots of a poly¬ 
nomial/(O given by (2.201). 


III. Conjugate Distributions 

3.1. General discussion. Consider a random variable X with a distribution 
density/(x, 6) Let di and 62 be two specified values of 6 and let 


(3.101) 


2 = log 


f(x, 62 ) 
fix, Oi) ■ 


For any hypothesis d = 6', let <^(f | 6') be the moment generating function 
of 2 . That is, 

(3.102) <i>(t 1 ^ 0 = f e‘7(x, e'} dx. 

J—tC 

Let h be the real non-zero value of t for which (f>(f | 8 ') = l’ and let 

(3.103) Fix) = e^’fix, F). 

Then Fix) is a distribution density. Following Wald [5], we shall call Fix) 
and Six, 6') conjugate distributions. 

The distribution density F(x) depends on 6 ,, 82 , and 8 '. In some instances 
Fix) will be a member of the class of distributions fix, 8 ). This is the case, 
for example, when 2 is a discrete variate. It is the case also if 8 ' = 61 . For 
then h. = 1 and F(x) = fix, 82 ). If F(x) belongs to the class of distributions 
fix, 8), we shall designate Fix) by/(x, 8 ") and call &' and 8" a conjugate pair. 


3.2. Conjugate pairs and the power curve for sequential probability ratio 
tests in which the imderlying distributions admit a sufficient statistic. Let 

fix, 6 ) admit a sufficient statistic and let a sequential test be defined in terms 
of the piobabhity ratio z given by (3.101) for some specified hypothesis di and 
alternative hypothesis 82 with di < 82 . Let the boundaries be given by — b 
and a where a and h are positive. Smce fix, 6 ) admits a sufficient statistic, 
it can be written in the form 

(3.201) fix, 6 ) = 

The probability ratio 2 is then given by the simple expression 

(3.202) z = u(x)[i'(02) — ^(ffi)] + wie 2 ) — widi). 


‘ If X is discreto, then/fx, 6) stands for the probability that X •= x when 9 is true 
^ See section 2.31 and Lemma II, section 2 32 in [6] 
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Let 


(3 203) 

b* - ^ 

ll(02) — l>(0l) 

(3.204) 

^ viBi) — u(0i) 

(3,206) 

_ in(0i) — widi) 

^ «(0i) - tj(0i) ' 

In terms of b*, a* 
lines* 

and s, the sequential criterion is defined by two parallel 

(3.206) 

An ~ —b* + sn 

(3.207) 

= a* + sn 

n 

and the decision functions X) The hypothesis 0 is accepted 

a-l 

n 

whenever ^ io{zu) 

a-l 

n 

< An and rejected whenever u(x«) > I?«, If 

0-1 

n 

An < I2'U'(a:«) < 

Rn , another observation is taken. This process is con- 

tinned until one or the other decision is reached. 

In what follows, we shall restrict ourselves to the general class of functions 
fix, 0) for which the differentiations under the integral sign indicated below 
are permissible and v(0) is a monotonic function of 0. 

Consider the function 

(3.208) 

^(0) = soiB) 4- w(0). 


We shall show that ^{6) = coastaEt has exactly two roots in 6. To this end, 
we prove the following theorems. 

Theorem 1. Let Euix) ( 6e f/ie exjteeUd valw of u{x) under the assumption 
that 6 is true. Then there exists a value of 6 = do such that (a) Eu{x) | flo = *: 
Q>) < 6o < 6% and Eu{x) | < Eu{x) | if v(0) is an increasing func¬ 

tion of 9, and the inequalities are reversed if v{6) is a decreasing function of 9. 

Proof: Assume that v(9) is an increasing function of 0. Let z* == u{x) — s 
and let <t>{t) | 6 be the moment generatmg function of z* under the hypothesis 
that 6 is true. Then, it is easy to see that 4>{h j 0i) = 1 and <^(—h | 0s) = 1 
where h = vidi) — v(9i). Since h is positive, it follows by Lemma 1, section 
2.6 of [6], that Ez* | 01 < 0 and Ez* | 0i > 0. Therefore, Eu{x) j 0i < s and 
Euix) I 02 > s. Moreover, as we shall see in the proof of Theorem 2 below, 
Euix) I 6 is assumed to be a continuous function of 6 and proved to be mono- 

* It is here assumed that > 0. If this is not the case, then o* and b* have 

to be interchanged. 
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tonically increasing. Hence it must follow that there exists & 9 = Bo such that 
Eu{x) I 00 = s and 6 i < 6 o < 62 . This proves the theorem in case v{d) is mon- 
otonically increasing. However, the argument is identically the same in case 
v( 6 ) is monotonically decreasing. 

Theorem 2. Let he defined as in (3 208). Then is a monotonically 
increasing Junction of 6 in the interval 6 < 6 a assumes a maximum at 6 = 60 ; 
and is a monotonically decreasing function of 6 m the interval 6 > 60 . 

Proof. If we differentiate twice the identity 


(3.209) 



gi.(i)ti(«+r(x)-h»(») 


= 1 


with respect to 6 we get 

(3.210) v'(e)Euix) I 6 + w'{e) = 0 
and 

(3.211) v"{e)Eu{x) I 6 + w"(e) = b'(0)]V“(.) 

where <ro(i) is the variance of u(x). Also, if we differentiate under the integral 
sign the function Euix) [ 6 with respect to 6 , we get 


(3.212) 


dEu(x) I 6 


ll'(0)trl(»). 


Now by hypothesis, v{ 6 ) is monotonic in 6 . Hence from (3.212) we see that 
Eu{x) I 6 is also monotonic. Moreover, if v{ 6 ) is an increasing function Of 6 , 
so is Eu{x) I 6 , and conversely. Let us assume that v{ 6 ) increases with 0. 
Then for all < 0o, jBm(x) | 6 < s and for all 6 > Ba, Euix) [ 0 > s. Conse¬ 
quently, we have 

(3.213) ^p'iB) > v'i 6 )Euix) \ B + w'iB) 
for all 6 < da and 

(3.214) ^'{ 6 ) < v'i 6 )Buix) 1 B -f w'( 6 ) 

for all 0 > 60 . But by (3.210) the right-hand side of these inequalities is equal 
to zero for all 6 . Hence ip'iB) > Oior 6 < Ba and yj/'iB) < 0 for > 60 . The 
same argument holds when vid) is a decreasing function of 6 . Now let B = 60 . 
Then by (3.210), we see that V''(^o) = 0. Hence, ^(0) is a maximum eA 6 = 6 a. 
This proves the theorem. 

Let c be any constant < ^(flo) within the domain of ^(0). Then by Theorem 
2, the equation ypiB) = c has two roots in B. Let these roots be designated by 
6 ' and 6 ”. We now prove the following theorem. 

Theorem 3. Let z* and fiit \ 6 ) be defined as above. Then (a) (pit | 0') = 1 
fort = viB") — viB' ); (b) </)(^ 1 6 ") = 1 for t = viB') — viB "); and (c) B' and 9" 
from a conjugate pair with respect to z*. 
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Proof: By definition 

(3.215) <l>(t \e') = [ 

V— 09 

Now let i = vid") — v(0') = h. 
we get 

(3.216) <t>(h I S') = I 

J—qO 


Then, in view' of the fact that 


m 


m, 


In a similar manner, it can be shown that (^(—/i | ^'0 = 1. Moi cover, the same 
argument also shows that f(x, 8") = 6 '). This proves the theorem. 

Turning now to the se,quential test defined by (3.206) and (3.207), we .see that 

n 

it is equivalent to a test with the decision function Zt = 2 Za and the two 

boundaries —h* and a*. Let Le be the probability that the sequential test will 
terminate and Zt < —b* (i.e. the hypothesis 6 i is accepted) when 8 is true. 
Then (neglecting the fact that at a decision point might exceed a* or fall 
short of —hi), Lei and L}<< are given by (see for example (2.406) m [6]). 

(a*+»*)4 _ h’h 

( 3 . 217 ) Lei = 


and 

-A(o*+»*) _ -hi" 

( 3 . 218 ) Le" = = e-^'’'Le' 

6 X 


where h = vid") — v(d'). Thus, we see that the two roots of the equation 
^(8) = c determine two points on the power curve for the sequential test. By 
assigning various values to c we obtain as many pairs of points as desired. 

The above results show that for the class of distributions under consideration, 
the real non-zero roots of (p(t | S) = 1 are obtainable from the roots of \p(d) = 
constant. Since i/{8) is completely defined by the form of the distribution 
fix, 6), the power curve of the sequential test can be obtained without a knowl¬ 
edge of the moment generating function of z*. This might be advantageous 
in some cases. 


3.3. The distribution of n under conjugate hypotheses. Le^ Pt,i7i ( g) stand 
for the probability that a sequential test will terminate with Zn < — 6 in exactly 
n steps when the distribution density of x is g. Let Pain | g) be similarly defined. 
Theorem 1 . If we neglect the excess of Zn over a and —b at a decision point, 

(3.301) Pb(n|F) = e-**A(n|/) 

(3.302) Pa(am = e*»P.(nl/) 

where f and F are conjugate dislnbutions as defined in (3.103) and h is ^non-zero 
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real value of tfor which the characieristic function of z = log/(x, 62 )/ f(x, 8 i) 

underthe hypothesis f, equals 1. 

Proof: Since, by definition, F = e‘Vf it follows that ypit — h\F) = |/) 

where | F) is the characteristic function of z under the hypothesis F. Let 

(3.303) <}>(.t\f) = 

where t is a pure imaginary. Furthermore, let <i(t) and hir) be the roots of 

(3.303) such that lim<i(r) = 0 and lim<ii(T) = h (see [2], page 289). Then 

T-»0 T-»0 

<i(t) — h, and t 2 (T) — h wiU be the corresponding roots of 

(3.304) f,(t\F)^e-\ 

Now by the Fundamental Identity we have 

(3.305) + (1 - = 1 

(3.306) + (1 - = 1 

and 

(3.307) + (1 - = 1 

(3.308) + (1 - = 1 

where L/ = P{Zn < —b |/], Esf stands for the expected value of e’'" under the 
hypothesis / and the restriction Zn < —6; Eaf stands for the expected value of 
e'" under the hypothesis / and the restriction Zn > a; and the symbols Lr , 
Et,F and Enr are similarly defined. 

By comparing equations (3.305) and (3.306) with (3.307) and (3.308) we 
see that 

(3.309) LrEhre^” = e ^^LfEb/e^’' 
and 

(3.310) (1 — Lf^Eara^" = e*“(l — Lf)EafB'’', 

Since the above relationships hold for the characteristic functions of n, they must 
also hold for the distribution of n. This proves the theorem. 

If we set T = 0 in (3.309) and (3.310) we also get 

(3.311) Lf = e'’"’Lf 
and 

(3.312) 1 - = e'“(l - Lf). 

In view of (3.311) and (3.312) we see from (3.309) and (3.310) that 

(3.313) Fb,e" = Ebfe^’' 
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and 

(3.314) Bare”' » JSJn/e". 

From (3.313) and (3.314) we obtain the following rather surpriaing theorem. 

Theokem 4. Except for the approximation indicated in" Theorem 1, the con¬ 
ditional disirihution of n under the restriction that Zn < —h as weU as the restric¬ 
tion that Zn'> a is identical for the two hypotheses F andf. 

The above theorems are of particular interest when F is a member of the 
class of distributions /. In any given sequential test the results of Theorem 1 
can be used to facilitate the computation of the probabilities of making a de¬ 
cision. Furthermore, the results of Theorem 4 show that the conditional dis¬ 
tribution of n throws no light on the parameter 6 involved in the distribution 
of 2 . This follows smee the conditional distribution of n is identical for the con¬ 
jugate pair o' and 8 ", and, in any practical problem, 6 ' and 5" will represent 
opposmg hypotheses. 

We shall now establish exact relationships of the type considered above when 
the variate z takes on a finite number of integral values. 

Let 2 take on the values —r, —r -f 1, • • •, — 1, 0,1, 2, • • •, in with P{z = i) = 
p, . Furthermore, let P, = e*‘p< where h is the real non-zero root of 

(3.315) E p.6" = 1. 

»•—r 

Then the probabilities P< and p, are conjugate. We set e* = u and define 
</)(w I e) to be the generating function of z under the hypothesis p(z = i) = 6 i. 
Then 

(3.316) <tt(u I p) = E 

•»~r 

and 

(3.317) ^(«|P) = E P.«‘ = E Pi(e'-uy 

t—r 

Consider a sequential test defined by two boundaries —b and a and a decision 

n 

function Zn = Let and stand for the probabilities that Z„ = 

a^l 

— (f) + i) and Z„ = a + f respectively under the hypothesis that 0i — P{z — i). 
Furthermore, let Pii,(n | 6 ) and Po,(n 1 6 ) stand for the probabilities that Z„ = 

— (b -f- 1 ) and Z„ = (a d- i) respectively in exactly n steps, under the hypothesis 
6 , = P(z = i). Also, let the symbols and stand for conditional expecta¬ 
tions under the hypothesis 8 , = P(z = i) and under the restriction that Z^ = 

— (b + i) and Z„ = o t respectively. 

Since z takes on a finite number of integral values, the Fundamental Identity 
for the two conjugate hypotheses, p and P can be written as: 
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(3.318) 
and 
(3 319) 




E I p)]-" + T, ^Lu'^^'E^Uiu 1 p)]“" = 1 


r-1 


E <[«(«! p)r 


^- 


s 

t-0 




P)]'” = 1. 


For any real number r let Ui(t), 112(7), ■ ■ , Ur+mir) be the r + m roots of the 
equation: 


(3.320) 


<p(u 1 p) 



1 

r 


Then, in view of (3.317) the corresponding roots of 


(3.321) 


<P(^IP) = L P.«‘ = - 

»—f r 


are given by ui(T)e \ tij(T)e *, •••, Ur+n(T)e * Substituting these roots in 
(3.318) and (3.319) successively, we get 


(3.322) E if. W,(r)"'‘'Pf, r" + E ff. U, (r)" 


‘Ei 


= 1 


and 
(3 323) 


li ^".[«,(r)e-r'‘^‘'Pr.r" + E fr.[«.(r)e-r^’Pr.T* = 1 

«m0 


for j = 1, 2, , r + m. Since the roots u,(t) are assumed to be known, the 

unknowns in (3 322) and (3 323) can be solved in terms of these roots provided 
the determinant of the equations is different from zero. But in section 2, we 
have indirectly shown that for a sufficiently small t, the determinant is dif¬ 
ferent from zero. Thus, assuming that the solution has been obtained we see 
from (3.322) and (3 323) that 

(3.324) if. Pm t" = Pf. r" 

and 


(3 325) 


fr.pr.r" 


e““+’^ff.P 


p 

o* 


n 

r . 


Setting T = 1, we get 

(3.326) if, = 
and 

(3.327) if. = . 

Moreover, if we expand the expressions in (3.324) and (3.325) m a power series 
in T (which by section 2 is permissible), and compare coefficients of t* we get 
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(3.328) Pi,(fl|P) = rt,(ii|?) 


and 

(3.329) P.,(n|P) = e‘^'t(!i|i)). 
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SUFFICIENT STATISTICAL ESTIMATION FUNCTIONS FOR THE 
PARAMETERS OF THE DISTRIBUTION OF MAXIMUM VALUES 

By Bradford F. Kimball 
New York State Department of Public Service 

1. Summary, The problem of estimating from a sample a confidence region 
for the parameters of the distribution of maximum values is treated by setting 
up what are called "statistical estimation functions” suggested by the func¬ 
tional form of the probability distribution of the sample, and finding the moment 
generating function of the probability distribution of these estimation “functions. 
Such an estimate by the method of maximum likelihood is also treated. 

A definition of "suflficiency” is proposed for “statistical estimation functions” 
analogous to that which applies to “statistics.” Also the concept of “stable 
statistical estimation functions” is introduced. 

By means of a numerical illustration, four methods are discussed for setting 
up an approximate confidence interval for the estimated value of x of the uni¬ 
verse of maximum values which corresponds to a given cumulative frequency 
.99, for confidence level .95 Two procedures for solving this problem are 
recommended as practicable 

2. Introduction. If the univeise comprises a set of maximum values of a 
large number of quantities, it has been shown that in many cases the probability 
density function of such a set of values of x is given approximately by 

(2.1) /(x) == ae~*e~‘ t ~ a{x — u), - » < x < -f «, 

where a and u denote parameters, usually unknown [1]. 

This paper is concerned with the problem of estimation of the parameters 
a and u on the basis of sample data. 

The notion of "sufficiency” is fundamental in the problem of estimation, 
since it means that the necessary elements of the sample have been used which 
will result in complete determination of that part of the sample probability 
distribution function involving the unknown parameters to be estimated. 
Unfortunately it does not seem to be possible to set up "sufficient statistics” 
within the usual definition of "statistic” for the above distribution. In this 
investigation the writer was struck by the fact that certain functions of the data 
mvolving one of the parameters could be used to play a very similar role to a 
set of sufficient statistics for determining a and u, in spite of the fact that one 
function involved the value of a, and hence was not directly determined by the 
data,—and hence not a “statistic.” 

Various statistics have been used in the past to estimate the parameters a 
and u, such as the sample mean, variance, mean deviation and an adjusted 
modal value (see [2] and [3]). For the reason noted above, sufficient statistics 
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have not been developed. In order to bridge this impasse and meet the ea- 
sentials of the condition of sufficiency, the writer believes that a broader defini- 
tion of sufficiency is needed. Such a definition is developed in the following 
section. 

3. A broader definition of sufficiency. If the reader reexamines the process 
of estimating the two parameters of the normal distribution, and the deter¬ 
mination of the two parameter confidence region for them from the statistics 
consisting of the sample mean, and the mean square deviation of the sample 
values from their mean, he will find that the separate determination of £ and 
s“ is not inherently necessary. The mean a and the variance or° of the universe, 
are usually estimated from the pair of equations 

E{£) = a, E(s*) = (n - Da-^n 

and the boundary of the confidence region is determined from knowledge of 
the bivariate distribution of x and s, which involves the four variables i, s, 
a, and <r. The equation of the bounding curve is most easily set up in terras 
of transformed variables such as 

(3.1) U = y/n T = \/n s/a. 

Then the probability density of 1/ and Y is given by 

/(I7,y) = (const.)y"'V"'’+''*>'^ 

and with confidence coefficient /9; a. hounding curve may be defined implicitly 
by the two equations 


lff(U,V)dUdV = ^, 
f(Ui,Vt) = constant 

where the above integral is taken over the region of the V ^ 0 half of U,V 
plane bounded by the curve f(Ui ,Vi) = constant. 

A range of estimate of the parameters a and a is offered by this confidence 
region by virtue of the fact that each point of the region corresponds to a unique 
pair of values of a and <r for a given set of sample values On(®i), and the fact 
that the equation of the bounding curve does not involve the parameters o and a. 
Thus one arrives at a determinate range of estimate of a and a, after the sample 
values have been observed. In this paper such functions will be referred to 
as statistical esiimation functions (see [4]). 

The classical idea of sufficiency implies (a) that the estimate be adequate 
for unique determination of the parameters, and (b) that all the sample in¬ 
formation pertinent to such estimation be used. In the case of “statistics" 
the second requirement has been simply and elegantly formulated by the 
requirement that the probability density function of the sample distribution 
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factor in such a way that one factor be completely determined by the statistical 
estimates and the parameters of the distribution, and that the remaming factor 
be independent of the parameters to be estimated (see [7], or [5] p. 135). 

It seems to be possible to carry over this formulation to statistical estimation' 
functions (denoted by Ti). Since one or more of the parameters to be estimated, 
denoted by (oi, oj, - • •, a,), are involved in these functions, a requirement that 
they be adequate for unique determination of these parameters is obviously 
that there be a one-to-one correspondence between the parameter set (ai, 02 , 

• • •, Or) and the set of estimation functions (Ti, 7*2, • ■ T,) in the region of 

estimate. This requirement wiU be referred to as Requirement (1). 

It has been pointed out by a referee that some further requirement as to the 
independence of the probability density function of (Ti, !r 2 , • ■ relative 
to the parameters to be estimated is needed. 

If one requires that the p. d. f. of (Ti , Tj, • • Tr) be entirely independent 
of the parameters (oi, 02 , ■ • Or) the estimation functions ivill furnish “con¬ 
fidence regions” for estimates of the parameters;—see example noted above 
for the normal distribution. 

Hovever, in seme cases the mean values E(T^ may be independent of the 
parameters, while the p. d f. may not be; for example, —estimation functions 
for the two parameters of the Pearson Type III distribution formed from the 
maximum likelihood functions of that distribution. In such cases, a point 
estimation of the parameters is still possible, and would seem to satisfy the 
classical requirements of sufficiency. 

The author accordmgly makes the following proposals: 

(a) Statistical estimation functions that satisfy the first two requirements— 
that of one-to-one correspondence with the parameters to be estimated, and the 
factoribility condition—^be termed sufficient for estimation of the parameters. 
The reasonableness of such a definition is strengthened by the observation 
that given a set of “sufficient statistics” in the classical sense, statistical estima¬ 
tion functions that satisfy the factoribility condition can always be formed from 
them, and hence they are subject further only to Requirement (1) to make 
them sufficient statistical estimation functions under the proposed definition. 

(b) Statistical estimation functions that satisfy Requirement (1) and also 
have a p. d. f. which is independent of the parameters to be estimated shall be 
called stable —a term suggested to the author by a referee. 

(c) Statistical estimation functions T, that satisfy Requirement (1) and are 

such that E(Ti), (i = 1, 2, • ■ r), be independent of the parameters to be esti¬ 

mated, be called stable in mean, and that similarly, if the modal or median 
value's of T, be independent of these parameters, they be called stable in mode, 
stable in median, etc 

Thus a definition of sufficiency applicable to statistical estimation functions 
is formulated as follows: 

The term “statistical estimation function” will be used to denote a function 
of the sample values and one or more population parameters, used for purposes 
of statistical estimation 
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Given a univerBe -with probability density function involving n parameters 
ai, ai, • ■ am in an admissible region R, and a set of r statistical estimation 
fimctions r,( 0 „; fli, 02 , ■ ■ Om) to be used for estimating the r parameters 
<ii, ai, •••, Or relative to the information in a given sample 0 „ . Consider 
the conditions; 

( 1 ) The functional form insures a one-to-one correspondence between 
the points of the r-pararaeter space (ai, Oj, - , Or) contained in R and the points 

of the r-space defined by (Ti, 'A , ■ * A) for fixed 0 „(a:() and fixed parameter 
values flr+i j af ^21 ' *' j am • 

(2a) It shall be possible to express the probability density function of the 
sample On as 

f*(0n) — I A 1 ■ ‘ ■ J Tf J fli J Oa , * * *> Um) ‘^2(0n f Or-t-l j O'r+Z j ' ‘ j am)j 

where the first factor is uniquely determinable for fixed (ai, 04 , ■ ■ ■, a™) from 
the corresponding values of the functions T,, and the second factor is inde¬ 
pendent of the parameters to bo estimated. 

(2b) It shall be possible to express the probability density function of the 
sample 0 „ as 

■P(On) " G^(Ti J Tjj * *j T f f Ui f a 2 i '**1 am)^ 2 ( 0 n ) Ur f -1 J 2 j ' ' ‘ » am)j 

where Q{,, • ■ ■, ; oi, a*, • • •, am) is a functional, depending on ai, a*, ■ • •, o™ , 
which in general involves the values of the A for values of Oj, Oa, • • •, a„ 
different from those appearing in the rest of the identity. (For example, 

G{T, a) = exp f T(0„ ; a')da'.) 

•'0 

(3) The r-variate probability density function of T, based on P(0„ ; ai, 02 , 

• • ■, am) shall exist. 

Definition A. A set of statistical estimation functions T{ which satisfies 
conditions ( 1 ) and ( 2 a) will be said to be a sufiicient set of estimation functions 
for estimating the parameters 0 ,, (t = 1 , 2 , • • •, r), relative to the sample 0 „. 

Definition B. A set of statistical estimation functions T, which satisfies 
conditions ( 1 ) and ( 2 b) will be said to be a 'functionally sufficient set of estima¬ 
tion functions for estimatmg the parameters a; {i = 1 , 2 , ■ ■ r), relative to 

the sample On. 

Dffiniiion C. If the conditions (1) and (3) arc satisfied, and the p.d.f. of 
{Ti, Ti, • ■ Tr) is independent of the parameters Cj, (i = 1 , 2 , • • •, r), the 
functions Ti will be said to be stable relative to estimation of these parameters. 

Definition D. If the conditions (1) and (3) are met, and E(Ti), (i = 1, 2 , 
■ ■ ■, r) are independent of the parameters to be estimated, the functions Ti 
will be said to be slahle-in-mean] and aiuularly if modal or median values of T{ 
are independent of these parameters, the estimation functions will be said 
to be siable-in-mode, stahle-in-median, etc. 



STATISTICAL ESTIMATION FUNCTIONS 


303 


It is not difficult to prove that a set of maximum likelihood functions 
La = 3[log P(0n ; a, ^)]/da, = a[log P(0„ ; a, 0)]/d^ 

under the condition that the second order determinant 


Lau Lafi 

Lfia Lfif 


exists and does not vanish over the admissible range of a and i?, constitutes a 
set of estimation functions for a and /3 that are functimally sufficient and stable- 
in-mean under the definition given above. The meeting of Condition (2b) 
is demonstrated by the relation 


log P(0„ ;oi, ^) = f La(a, Po) dot + f Lfi(a, (3) dfi + log P(0„ ; 

•'ao •^/3o 


ao 


/9o) 


since the first two terms on the right depend entirely upon the functions La 
and Lfi, and the third term on the right becomes independent of a and /S, if 
ao and /So are arbitrarily chosen, once for all, in the admissable region P. 

In general the maximum li k elihood functions are not stable estimation func¬ 
tions, but in many cases by the introduction of suitable factors which appear 
in the variance-covariance matrix (see (5.3) and (5.4)) estimation functions 
may be formed which satisfy Definition C. 


4. Sufficient statistical estimation functions for the distribution of m axim um 
values. The probability density function for the sample 0„(x,) drawn from 
a universe of maximum values is 

(4.1) P(0„) = „n^-Z.-W-)g-a2(x.-«) 

where the summation sign used here and hereinafter refers to summation over 
all indices from 1 to n. Let x denote the sample mean, and define a new set 
of variables z, by 

(4.2) z. = e-“', (i = 1, 2, ■■■,n), 

with mean z. Also set 

zq ^ e 

Recognizing that the variables 2z, /zo are independently distributed like x* 
on two degrees of freedom, the probability density function of z is given by 

(4.3) P(z) dz == [l/r(n)]e~”‘'‘°(nz/zo)''~\ dz/zo 

with mean equal to Zo and variance equal to zo/n. 

The mean value of t of the original distribution (2.'‘.) is known to be Euler’s 
constant, which will be denoted by C. Thus 

(4.4) E[oi{x - ii)] = C = .6772157. 
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The above considerations point to a set of statistical estimation functions 
defined as follows 


(4.5) 


X = s/n [ot(,x — u) — C\, 
y = [l/zo — 1]. 


The author was not able to determine the explicit bivariate probability density 
function of X and Y, but the moment generating function 0 may be found 
with some degree of facility if the variables z,' are used in (4.1). Using sim¬ 
plified functions na{x — u) and nS/zo, 

(4.6) (?(fli, di) = = (1 - e2)”‘'‘“''r"(l - Oi). 

Clearly x and S are not statistically independent The first and second partial 
derivatives give 

G,,(0, 0) = nC, (?,,(0,0) = n, (?,,,.(0, 0) = T»rV6 + n’C*, 

(4.7) 

<?«i«i(0, 0) = n’ + n, Gjj 9 ,( 0 , 0) = n^C — n. 

Hence the variances of the marginal distributions are 

(4.8) a^[na{x — u)] = nirV®) a^inz/zo) = n, 
and the covariance is equal to —n. 

Now the marginal distributions rapidly approach normality with increasing 
n. The question arises whether the bivariate distribution approaches normality. 
One way to prove this is as follows: Consider the moment-generating function 
Gi of the statistical functions X and Y defined by (4.5). Followmg methods 
outlined above, with 83 = = \^n 03 , it is not difficult to show that 

the logarithm of the moment generating function ©2(05 , 04) is given by 

logG2 

= (Vn 03 - w) log (1 - 04/ ^/n) — Vn 04 + nlog r(l — 0s/ Vri) — Vn C. 
As n —> 00 , one notes the relations 


-nlog(l — 04 /Vn) — \/n 04 = 04/2 -|- oi(\/n), 

(4.9) nlogr(l - 0s/Vn) - VnCO, = (05/2)(tV 6) -f- OtiVn), 

■s/n.0alog(l — di/-\/n) = —0204 -f- Os(\/n), 

where o,(\/n) denote functions that approach zero as y/n —^ °o, uniformly 
for 63 and 04 in the neighborhood of zero. The limit 


lim log Ot = §[0j - 20,04 -f (tV6)0}] 


18 recognized as the logarithm of the moment generating function of a normal 
bivariate distribution 
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Thus the Invariate probability distribution function of the estimation functions 
X and Y approaches the normal bivariate distribution with zero means and variance- 
covariance matrix 


(4.10) 


irVO — 1 
-1 1 


as n increases without limit, and the means and second order moments thus indi¬ 
cated, hold precisely for all 'values of n. 

The functions X and Y satisfy Condition (1) for sufficiency relative to estima¬ 
tion of the parameters a and u provided a and u can be expressed as single valued 
functions of X and Y. A condition for this is that the Jacobian of the trans¬ 
formation shall not vanish. This Jacobian may be reduced to 

{{noa)/zm - (2x.e-“‘)/(Se"“*)]. 

Let X, be ordered so that x, ^ . Then for a > 0, the second term consti¬ 

tutes a weighted mean with positive weights which monotonically decrease as t 
increases, when the inequality x, < x.+i holds. Henee unless all x, are equal, 
this weighted mean is less algebraically than x. Condition (2a) for sufficiency 
is clearly met by these functions. Thus one concludes that for a > 0, and the 
case that not all x, are equal, the estimation functions X and Y constitute a sufficient 
set of estimation functions for the parameters a and u of distribution (2.1), Smoe 
the moment generating function (see (4.6)) is independent of a and «, these func¬ 
tions are also stable estimation functions 


6 . Maximum likelihood estimation functions. General theory points to 
the use of the method of maximum likelihood as giving the most efficient solution 
(see [5]) With 

(5.1) /(z) 

the maximum likelihood estimation functions are 


(5.2) 


L„ = —na{z/zo — 1) 

La = n[l/Q; — (x — u) -b d{z/zt,)/da] 


with variance-covariance matrix 

na* n(l — C) 

(5.3) 

n(l - C) (n/a’)kV6 -b (1 - 0*1 

Thus with 


X = i/n {i/zfi — 1], Y = -s/n [«(« — ze ““(le -b ?„/«)) — (ai — l)]/.5 
B = Vt*/6 + (1 - C)«, 
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where 

Za = d['Se'‘^^/n]/da, 

the bivariate distribution of X and Y rapidly approaches normality with in¬ 
creasing n, with zero means, unit variances, and correlation coefficient given 
by (negative, since sign of L„ has been reversed) 

(5.5) r = -(1 - C7)/(N/irV6 + (1 - C)*). 

With non-vanishing Jacobian, X and Y constitute a sufficient set of estimation 
functions for the parameters a and u {see (3.2) above). Furthermore the unit 
variances and correlation value given above are exact for all values of n. By setting 
up the moment generating function it is not difficult to show that these functions 
are also stable estimation functions for all values of n. 

The theory of maximum likelihood further shows that if H and a are defined 
as the u and a so\utions of the equations 

(5.6) Lu = 0, Z/a = 0 

the distribution of \/n (4 — u) and \/n (« — a) will approach normality asymp¬ 
totically with zero means and variance-covariance matrix which is the reciprocal 
of the above matrix (multiplied by n); namely, 

(l/a=)(l -f (1 - (7)V(^V6)] - (1 - 0/(irV6) 

-(1 - C0/(irV6) aV(//6) 

6 . Numerical applications. As an illustration of the application of the 
methods outlined above for determining the parameters of the distribution of 
maximum values from an observed sample, data is taken from the 57 year 
record of annual maximum flood flows previously used as an illustration by the 
author ([6] p. 324). There is some evidence to indicate that such a series 
follows approximately the distribution of maximum values. At any rate the 
series serves pretty well as a numerical illustration. 

Confidence regions for u and a can be determined by four methods based 
upon the preceding theory. In order to make the numerical illustration more 
cogent, we shall answer the following question by each of the methods. What 
is the confidence interval (ivith confidence level .95) for annual flood x correspond¬ 
ing to a cumulated frequency of .99 (often referred to as a 100 yr. flood) based 
upon our observed 67 yr. sample, under the assumption that the distribution 
of maximum values (2.1) applies to this data? 

Method 1. (Based on estimation functions of section 4.) In this case the 
statistical estimation functions Xi and Yi defined from (4.5) by Xi = X \/6 /tt, 
Yi — Y, are used. The “best values” of u and a are taken as the solutions 
of Xi = 0, Fi = 0, found by trial and error. As a starting point values of u 
and a may be estimated from Xi = 0 and the standard deviation of x, (see 
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[2] or [6]), the mean deviation of k, , or an adjusted modal value (see [3]). A 
few trials gives 

H = 179.7, a. = 01998. 

Approximating the distribution function of Xi and Fi by the limiting normal 
bivariate distribution (4.10), with confidence level of .95 the equation of the 
bounding constant probability ellipse is found to be 

(6 1) X\ + (1.5594)Xiyi +Yl = 2.3491 

where the constants are independent of the sample values. This ellipse, by 
virtue of the one-to-one correspondence between (Zi, Fi) and (u, a) bounds 
u and a based upon the observed sample (see [4]). 

For cumulated frequency .99, the distribution of maximum values (2.1) 
yields 

t = aix — «) = 4.60015 

Thus the analytic problem is that of determining the maximum and minimum 
value of 

(6.2) a: = g{u, a) = 4.600l5/« -|- u 
which occurs on the ellipse (6.1).' 

The writer solved this graphically. It was found necessary to compute 
three values of I,—at a — .01, .015 and .025, in addition to the value of z at 
a = .01998 previously found. From these computations the curves a = .01, 
a = 015, a = .01998 and a = .025 were drawn on the chart of the ellipse (6.1). 
The u = const, curves were quite easily determined by points on the a = const, 
curves found from their Xi coordinates which are linear functions of u (see (4.5)). 
The extreme values of a) will be found to occur near the extreme values of a 
on the ellipse. A construction of several « = const, curves near these extremes 
enables one to determme several successive values of g{u, a) at points where 
these curves cross the ellipse. The answers were 

Max. g{u, a) = 507.4 at w == 192, a = .01459, 

(6.3) Mm. g{u, d) = 360.0 at •« = 172, a = .02447, 

and g{% a) = 409.9. 

Method S. (Based on maximum likelihood statistical estimation functions 

(5.4) ). For purposes of comparison the writer carried through the solution 
using the maximum likelihood estimation functions Xj and F 2 defined by (5.4). 

' Since with non-vamshing Jacobian of (Xi, Yi) relative to (a, a), no singular point of 
the (u, a) coordinate system can he within the ellipse, it is clear from the form of the func¬ 
tion a) that its maximum and minimum values will lie on the boundary of the ellipse. 
A similar remark applies to Methods 2-4. 
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In this case the equation of the bounding ellipse was 

(6.4) + (.626l4)X*yj + 7, = 5.4042. 

The determination of the network of a = const., u = const, curves was much 
more complicated in this case. The results were 

Solution of Z = 0, y = 0, gave H = 180.6, a = .01924; g(iX, «) = 419.7 

Max. giu, a) = 509.5 at u = 187, a = .01426 

( 6 - 5 ) . . V 

Min. g{u, a) = 364.4 at « = 172, a = .02391. 

The slightly anoaller range of estimate of g{u, at) resulting from the use of 
the second method was forecast from the general theory which predicts a narrow¬ 
ing of range of variation of u and a for same confidence level. Both bivariate 
distributions involve exact momenta of the first and second degree for finite n, 
and both approach normality rapidly with increasing n. Hence comparable 
results were to be expected. Of course the form of the function g{u, a) in relation 
to the different types of estimation functions used in the two cases might modify 
the comparability of the results. 

Method S. (Based on limiting distribution of maximum likelihood statistics 
a and a, with variances unknown.) The use of the limiting distribution of 
the estimation functions Vn (il — u), Vn (« “ «) led to results which were 
not entirely expected by the author. Taking 

Z, = Aai'd - u)/B, y, = 4(A/a - 1) 

(6.6) A = w ■\/nl\/l, B = -f (1 - Cy, 

with 

r = -(1 - 0/B, 


the equation of the bounding ellipse is the same as (6.4), (no reversal of sign of 
r occurs because sign of r in (6.4) was reversed by reversing sign of in (5.4)). 

Using the inverse method where the range in u and a, ivith 'Cl = 180.6, A = 
.01924, is determined from the range of (Zs, yj) within the ellipse (6.4), the 
maximum and minimum obtained for g{u, a) was 


(6.7) 


Max. giu, a) = 490.2 at m = 193.2, a = ,01549 
Min. giu, a) = 353.8 at u = 174.0, a = .02558. 


This result does not agree closely with the previous results. The reason for 
this discrepancy may be that since the variances indicated by (5.7) are tloI 
exact for finite n, a variation of a from the central value predicted by (5,6) tends 
to exaggerate the departure of the distribution of Z and Y from the limiting 
normal distribution through its effect upon the variances. The plausibility 
of such an explanation is strengthened by the numerical results of a solution 
of our problem by Method 4. 



STATISTICAL ESTIMATION FUNCTIONS 


309 


Method (Based on limiting distribution of maximum likelihood statistics 
•d and a, with variances estimated by taking a = a as observed from the sample.) 
In this case the unknown variances are estimated by taking a = a as observed 
from the sample studied. In order to avoid confusion let oo denote this value 
of CL as used in the variance formulae. Thus the estimating functions and 
Yl become 

( 6 . 8 ) Z 4 = Aao(ii - u)/B, 74 = A(& - a)/ao 

and the approximating distribution of (X 4 , F 4 ) is taken as the same limiting 
normal distribution used in Method 3. With 

Ua = ‘Cl = 180.6, oo = a = .01924 

the extreme values of g{u, d) on the ellipse were 

Max. g{u, d) = 607.4 at « = 188.6, a = .01443 

(6.9) 

Min. giu, d) = 362.8 at « = 169.7, a. = .02382. 

These results agree closely with the results obtained by Methods 1 and 2 . 

The confidence intervals in g{u, a) obtained were, in summary. 

Method 1 360.0 to 607.4 

Method 2 364.4 to 509.6 

Method 3 353.8 to 490.2 

Method 4 362.8 to 507.4. 

From the analysis of the four methods presented above, one might recom¬ 
mend the following two procedures for finding the confidence interval for x 
m a problem of the above description, as practicable: 

Procedure 1. Use Method 1. 

Procedure 2. Determine the maximum likelihood estimates 'd and a from 
(5.6) by trial and error. Then use Method 4. Presumably the second procedure 
would be more open to question, especially for small values of n. 
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ON FtTNCTIONS OF SEQUENCES OF INDEPENDENT CHANCE VECTORS 
WITH APPLICATIONS TO THE PROBLEM OF THE 
“RANDOM WALK” IN k DIMENSIONS 

By D. Blackwell and M. A. Gibshick 

Howard University and U. S. Department of Agriculture 

1. Summary. Consider a sequence (xf) of independent chance vectors in k 

dimensions with identical distributions, and a sequence of mutually exclusive 
events Si, Si, • ■ • , such that Si depends only on the first i vectors and 'SP{S,) 
= 1. Let (/j, be a real or complex function of the first i vectors in the sequence 
satisfying conditions: (1) E(<pi) = 0 and (2) E(,<ps | , • • •, X,) = (pi for j > i. 

Let If) = (pi and n = i when fi, occurs. A general theorem is proved which gives 
the conditions cpi must satisfy such that E<p = 0. This theorem generalizes 
some of the important results, obtained by Wald for fc = 1, A method is also 
given for obtaining the distribution of (p and n in the problem of the “random 
walk” in fc dimensions for the case in which the components of the vector take 
on a finite number of integral values. 

2. A basic theorem. 

2.1 Let {X,) = {(Xi,, Xz,, • Xjb,)} be a sequence of independent fc-dimen- 
sional chance variables with,identical distributions. Let Si, Si, Ss, ■■■, he 
mutually exclusive events such that (1) 3, depends only on Xi, Xz, • • •, X,, 
and (2) SP(Si) = 1. Let ipi(Xi , Xj, • • •, X<) be a sequence of real or complex 
variables satisfjdng the followmg two conditions: 

Condition 1: E(<p,) = 0 for all i. 

Condition 2: E{<pj | Xi, Xz, • • •, X.) = for all j > i, where E{ipj | Xi, Xz, 

• • •, X,) stands for the expected value of <pj under the condition that Xi, Xz, 

■ • ■, X, are held constant.^ Define <p, = <p and n = i when the event Si occurs. 
We shall assume that E{n) is finite. 

A problem of central importance in sequential theory may be formulated as 
follows: What conditions must p, satisfy so that E{(p) exists and equals zero? 
We shall piove the following: 

Theorem 2.1. If there exists a function f{xi, xi, ■ ■ ■, i*) >0 such that (a) 
^[/(Xt)] is finite and (b) (| < J] /(Xd) when n > i, then E(ip) exists and 

d-l 

equals zero. 

Before proceeding to the proof, we consider two consequences of this theorem. 

< 

I. Assume that E{X„) = a,. Let = S (Xr^ — a,). It is easily 

)-i 

verified that ip, satisfies conditions 1 and 2. We set /(xi, Xz, ••■,**) = | av 

* Chance variables ip( satisfying condition 2 have been extensively studied by P. Levy 
[1] and J L Doob [2]. 
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— Or |. Then Theorem 2.1 is applicable and we get E<p = 0. Now <p = Wr — 

n 

nar where Wr = . Hence we have 

««i 


( 2 . 11 ) 


E{Wr) = OrEin). 


The relationship (2 11) has been proved for fc = 1 by Wald [3] and subse¬ 
quently under somewhat more generalized conditions, by one of the authors (4]. 

II. Let < 1 , < 2 , • • ■, ifc be any real or complex numbers for which Ee^'~'^ — a 

is finite and | a ] > 1. We assume that there exists a positive constant M 
such that 


( 2 . 12 ) 

when n > m. Let 

(2.13) 




< M, r = 1, 2, • ■ • , fc, 


<Pi 


= - 1 


so that 


(2.14) ^ - 1 

where Wr is defined as above. It is easy to show that satisfies conditions 
1 and 2. Now, in view of (2.12), when n > i 


(2.15) 


<P* 




where is the real part of /,• and R = e is a fixed positive constant. 

Then, letting 


(2.16) 


,x,) = 1 + 


we may apply Theorem 2.1 and obtain 
(2.17) E{a~”e^'^-^*''"') = 1 

which is a generalization of the Fundamental Identity proved by Wald [5] 
for the case fc = 1. 

Proof of Theorem 2.1. Assume ¥>, is real. Define chance variables Nn 
inductively as follows: No = 0. Assuming No, • • •, Nm defined, define N^+i = 
Nm 4“ , Aljv '„+2 j '' ■)' Also let “ N m ■ Nm—i and = y(N^i^_j.^i) 

+ ■ ■ • + It can be shown by induction that Nm is defined for all m 

with probability one, and that {Hmj, (j/m) are sequences of independent chance 
variables with identical distributions. Clearly tii = n. 

The Strong Law of Large Numbers asserts that if , Z 2 , • • • are independent 

chance variables with identical distribution, then lim — ~b _ 

OT-*00 Til 

c with probability one if and only if Ezi exists and equals c. 
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(2.18) lim = E[f{Xx)] 


It Mows that, with probability one 

/(Z i)+ ••+/(X^) ^ 

m 

and 

lim • ^ lim ^ = E(n). 


(2.19) 


m 


ii->» m 


. j/l + ■ • • + ym _ i/lH- 


Since ^ is a subsequence of 

TJ-l + • ^ * T“ ^ TV m 

we have with probability one, 


/(Xi)+ ■■•+/(X„)' 


m 


( 2 . 20 ) 
so that 
( 2 . 21 ) 


lim = ElfiXt)] 

m-^ao JS m 


lim 


yi+ +y m ^ E[f{Xi)]E{n). 


m 


Consequently, E(yi) exists and equals Ef{Xi)E{n). Since \<p\ < yi, E{ip) 
ejdats, Also using conditions (2) and (b) which were imposed on ¥>, we have 


/ (p dp 

II 

M- 

11 

\ ■lai+"-+a{ 

1 1 y-i ./sy 1 

(2.22) 

= — / ipi dp 

\ ^ fl>» 

= £ 1 IP. dp 

]>( Jsi 


< 2 f i IP. 1 dp < 2 / Vi dp 

3>i •‘Sj d>» •'3/ 


which approaches zero as t -+ =o. This completes the proof. 

If <pj is a complex valued function, Theorem 2.1 still holds. For writing 
<Pi = ihythen Condition 2 becomes E{gj, ihp\Xi, • • •, Xj) = gj + ihf 
when p > j. Hence 

(2.23) E{g,\Xt,>--,Xj) =g, 
and 

(2.24) E(.h,\X^,^-;Xi) = hi 

when p > j. Since | j, | < | | and | Ay j < ] ipy | and vj satisfies condition 

(b) we may apply Theorem 2.1 and get 

(2.26) Eg - E{h) = 0. 

Hence E<p ■» 0. 



INDEPENDENT CHANCE VECTOHS 


313 


3. Applications to the problem of the random walk in k dimensions” 

3.1, A theorem concerning decision 'points^ Let {Xjj = 
be a sequence of fc-dimensional chance vectors with identical distributions. We 
assume that Xji (j = 1, 2, ■■■,&), take on a finite number of integral values 
ranging from — ry to inclusive, where r, and mj are positive integers. We 
remark that any distribution can be approximated to any degree of accuracy 
by the distribution of a variate whose values are integral multiples of a constant 
d, which can be taken as the unit of measurement. 

Let Pu,u,...in be the probability that X» = (ui, us, Ui). We define 

Wpi = ^ Xfj and set fj, = (Wi,, TFj,, Wt,). Then {i7i} represents 

1-1 

a sequence of points with integral coordinates in a fc-dimensional space Sk = 
l(l/i I 2 / 2 , • 2/*'}- Let be an arbitrary bounded region in Sk . We shall 

assume, without loss of generality, that the origin is an mterior point of R. 
We now define a random variable n as the smallest subscript i of the sequence 
( Uj} for which W, is either a boundary point or an exterior point of R. We set 
Un = W = (Wi, Wi, • ■ ■, W*) and designate W as a decision point of R. 
Clearly, the number of decision points is finite. 

The random variables n and W can be interpreted as follows: Consider a 
point Q which at the time t = 0 is at the origin. At successive intervals of 
time t = 1, 2, • • •, the point Q moves with integral components in 8 k the direc¬ 
tion and distance of the motion being determined by chance. The point comes 
to rest as soon as, but not before it either reaches the boundary of R or falls 
outside of R. Let Ut be the co-ordinates of the point Q At time t. Then n 
represents the length of time it takes Q to come to rest, and W represents a 
possible resting point.* 

We shall be concerned with the problem of finding the probability distribution 
of n and W. These will obviously depend on the shape of the region R, In 
what follows we shall restrict ourselves to the class of regions R which have 
the property that the intersection of any line parallel to the axes with f? is an 
open interval. In view of the fact that W has integral coordinates, we can with¬ 
out any loss of generality, replace this class of regions by an equivalent class 
which are bounded by simple polygonal closed surfaces whose vertices have 
integral coordinates and whose sides are parallel to the planes y, = 0. In the 
subsequent discussion we assume that the regions R are of this type. 

Let 

(3.10) l.u.b. [y.] 


• What follows is a generalization of a method previously employed by one of the authors 
[6] for the case 2; » 1. 

' That Q will reach a resting point eventually can be asserted with probability one. 
See A. Wald [6], Lemma 1. 
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and 

(3.11) -6, = g.l b [(y,, 2/2, • • •, 2/0 « 

v< 

then a, and.b, are positive integers. 

We now prove the following: 

Lemma 3.1. For the given sequence of chance vectors {Z<) and the given 
region R, the number of possible decision points Nb is given by 

(3.12) = n (a, + b, + r, + m, - 1) - n (a, + b, - 1). 

Proof: We shall first prove this theorem for a rectangular region R = 
where Ri is defined by — 6, < yt < a,, (i = 1,2, ■ • •, k) and then generalize 
the proof to any region of the class specified. 

Let B2 be a closed rectangular region defined by — {hi + r,' — 1) < y, < 
(o, + m, — 1). Then R 3 > Ri. Let S = Ri — Ri. It is clear that every 
integral point of jS is a possible decision point. Moreover, no point exterior 
to Ri is a possible decision point. For assume, for example, that there exists 
a point W = (Wi , W 2 , • • ■, Wt) which is an exterior point of Ri . Then at 
least one of its coordinates, say W,, has the property that Wj > a, + mj — 1 
OTWj< ~ih, + r, — 1). But amce —{b, — 1) < Wj.n-i < oy — 1, it must 
follow that X}„ took on a value greater than m/ or less than — ry which is con¬ 
trary to assumption. Now the total number of integral points contained in Ri 

k 

is JI (a, -f bj + wiy — 1) and the total number of integral points in Ri 

k 

which by assumption are not decision points, is n (®y + ~ !)• Hence 

the Lemma is proved if B is a rectangular region. 

Now, let R be any polygonal region of the type specified and let Ri be the 
corresponding rectangular region. Consider two randomly moving points Q 
and Qi , each having coordinates Wt at time t. Let the decision points for Q 
be defined in terms of R and the decision points of Qi in terms of Ri, We shall 
prove that the number of decision points for Q and Qi are the same. 

By assumption, every line parallel to the axes intersects i? in an open interval. 
Moreover Ri 2 R. Hence the sum of the areas of the segments which compose 
the boundary of jR must equal the area of the boundary of Ri . The same must 
be true for the total number of integral points on the boundaries of the two 
regions. Thus, the theorem is true for r, = = 1, (j = 1, 2, • • •, fc). We 

assume that the theorem is true for r, = ry and mj = m', and prove that it must 
hold for = mu + 1 for a fixed but arbitrary u. Now it is obvious that if 
the range of is increased by unity in the positive direction, the point Q 
can move an extra unit in the positive direction parallel to the 2/u axes. Thus, 
the total number of additional decision points that Q gams by the unit increase 
in the range of Xu, is identical with the total number that Qi gains. This 
proves the theorem. 
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It is clear that the smallest rectangular region which includes all the decision 
points of W i s . We now prove the following: 

Theorem 3.1. For any polygonal region R of the class previously specified, 
and for any random sequence (X.) in which X, lakes on a finite number of integral 
values, the number of points in ihe rectangular region which are not decision points 

k 

is always equal to n (oj + — 1) where Oj + bj are the dimensions of the 

j-i 

rectangular region Ri. 

Proof; This Theorem follows from Lemma 3.1 and the fact that the total 

h 

number of integral points in Ri is n (ttj -\r bj r^ + mj — 1). 

1-1 

3.2. The distribution of W. Let ^{ti , • -, tjt) be the joint generating function 
of Xui, (u = 1, 2, “ k), and (p{k , • • •, 4) the joint generating function of 

W,(j = 1,2, ■■■, k). Then 

-mh 

(3.21) Wu‘--,tk)= Z ••• E Pk, --tr 

U—rj 

(3.22) <t>{tu ■■•,tk) = Z ••• Z f.. 

til—(ki+n—1) (!>*+ri—1) 

where is the probability that W = (ri, • • •, «*). In terms of the gen¬ 
erating function 4' the Fundamental Identity (3.17) states that 

(3.23) Btf* • • • tPUik , • • • , 4)]“” == 1 

for all <1, • • •, 4 for which [ f(<i, • • •, 4) | > 1- Hence, it follows that for 

h, ■ • ■, 4 for which ^-(<1, • • •, 4) = 1, ¥>(4 , • • •, 4) = 1- Let 

(3.24) /(4 , • • •, 4) = <? ■ • tl" [^(4 , • • , 4) -1] 
and 

(3.25) g{ti, • • , 4) = • • • t**^'‘“* [¥>(4 , ■ • • . 4) -1]. 

Then /(4 , • ■ ■, 4) is a polynomial of degree r^ -j- m^ in ij and g(ti, • • ■, 4) is 

a polynomial of degree (oj + bj + r, -f- — 2) in 4 ■ 

We shall assume that/(4 , - 4) is an irreducible polynomial. Then, since 

g(ti, ■ ■ ■, tk) vanishes for all values of 4 , * • •, 4 for which/(4 , • ■, 4) vanishes, 
it follows* that / is a factor of g. That is 

1 OfcH-bj;—1 

(3.26) g(ti, ■■■ ,ik) = f(ti, ,tk) Z ••• Z C., 

where the C,,....,«» are unknown. Equating coefficients on both sides of 

(3.26) we get 


See, for example, BCoher [7], Theorem 7, Chapter 16. 
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(3.27) 


VI Vk 

£ * ■ ’ (■Pui-rf-«i*-T* 

u imO Ui^o 


SufTy) ('ti-rvu-ri + 
k 

II ^"iiky+ry-l 


■where dij is the Kronecker delta. But by Theorem 3.1, nu (ay 4- by — 1) 
of the f,j.. in , • • ■, /t) are zero since they correspond to values of W 
■which are non-decision points. Hence IIy_i (oy by -- 1) terms in (3.27) 
are zero with the exception of the term &i+ri-i—(corresponding to the 
non-decision point (0, 0)) which is —1. Hence, we have the required number 
of equations to solve for the unknown (7’s and consequently for the f’s provided 
the determinant of the coefficients is diflferent from zero. 

As an illustration, let U — Ri, then the C’s are obtained by solving the set 
of linear equations 


(3.28) 


E 


tk / k 

E(ni 

“*-0 \/-l 


uyry 


Hu,-ry < 


■•*—*■* 


— II ^•/.ky+'-y-i 


where tiy takes on all integral values from rytooy + by-f- r, — 2 inclusive. 

3,3. The disinbution of n. For any random variable U, let stand 

for the expected value of U under the restriction that W = (vi, Vt, • • •, t»*). 
Let ipi (<i, • ■ •, t*; t) be the joint generating function of PTi, TFj, * • •, IF*, 
and n. Then 


(3.31) <Pi(h t) = 

Ui «* 

Let 

(3.32) ••■><*;’■) = 7V'(ti, <i,’••,<*) — 1 

where ,—, f*) is the joint generating function of Zu, • • •, Xki and is given 
by (3.21) and let 


(3'33) f2(<i, ■ • h t) = v»i(<i, •• •, tk t) — 1. 

Then, if we fix t so that | r | < 1, we see by (3.23) that for all values of tk 

for which vanishes, also vanishes. Let 

(3.34) fx{ti ,•••, f* ; t) = {!*•• • fk^{t \, ••■,<*; t) 
and 

\ 

(3.35) /s(fi ,tk\r) = ... 4*'*^*“' , ••• ,tkl r). 

Then for t fixed, fi is a polynomial of degree ry -|- ot, in <y and/i is a polynomial 
of degree o, + by -|- ry -f- my — 2 in <y. Since fi vanishes for all values of 
k, • • •, f* for which f\ vanishes then if /* is irreducible, /* will be a factor of /j, 
That is fi can then be written as 
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oi+I'l—S J 

(3.36) E E 

»i-i m-i 

The rest of the argument is identical with that employed in section 3.3. The 
unknowns in the present case, however, are "When 

is expanded in a power series in t, the coefficient of t” is the probability that 
= (i;i, ■ ■ ■, Di) in exactly m steps. We shall, therefore, examine the validity 
of the expansion of the above function in the neighborhood of t = 0. 

Let us first consider the rectangular region 2E = iRi. In this case the d’s 
are obtained from the equations 

•ID*/* \ * 

(3.37) E .ui-rjijdDl-ri 

Uj-l / ,'-1 

(I'j = »■>■» »■» + 11 ‘ ) Oj + + »■/ ~ 2), 

so that will be given as a ratio of two polynomials in r the 

denominator of which will be the determinant of the coefficient's of (3.37). 
But this determinant equals unity when t = 0. Hence the validity of the 
expansion is established for a rectangular region. 

If R is not a rectangle, the value of the determinant of the equations in d 
will still be unity. This follows from the fact that the number of non-decision 
points in 222 is precisely the same as the number of non-decision points con¬ 
tained in Ri , hence by rearranging of the equations they an be made to assume 
the form (3.37). 
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APPROXIMATION OF THE DISTRIBUTION OF THE PRODUCT OF 
BETA VARIABLES BY A SINGLE BETA VARIABLE 

By John W. Ttjkey and S. S. Wilks 


Princeton University 

1. latroduction. In an article published elsewhere in the present issue of the 
Annals of Mathematical Statistics [1] the ^-th moments of two statistical test 
criteria and L„ were found to have the following expressions, respectively: 

cn tie — TT — 1 — z) + g) ~| r(jn(fc — 1)) _ 

U) i) li r(Kn - 1 - t)) J ranCA: ~ 1) + - D) 


r/: TT 1 - 0 + g) ] - i)(fc - D) 

W 1 ) ILL J r(Kn - 1)(A: - 1) + ( 7 (A: - D) • 

If we denote by («)„ the expression a(o + l)(o + 2) • • • (o + ff — 1) and 
make use of the fact that 


r(ffl + g) = r(a)-(a), 


(4) r(a + rg) = r(a) • (o),, = r(a) • 

where r is a positive integer, the two moments (1) and (2) reduce to 






respectively. 

For any given value of i (i = 1, 2, • • • , A: — 1) the ratio 

/n , i — l\ /n — 1 . i — k\ 

1,2 V 2 k-l)t 

may be expressed in the form 

r(p> + g) 

r(pi + «* + g) 

which is the g-th moment of a beta variable Ui distributed according to 

r(p. + gO — uY^~^du 

r(pdr(g.) 
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Each of the moments in (5) is therefore of the form 

n r(p, + g) 

r(p. + g. + ^) • 

Thus, Lmvc and Ltc are each distributed like th? product of A: — 1 independent 
beta variables. 

Each of the moments in (5) can be expressed in the general form 


Ma = 


Ml 


A, + 1 


B, + 1 


where x = - for —V A, and B, are real numbers. 
n\ n — 1/ 

Other likelihood ratio statistical test criteria which have been discussed in the 
literature have moments which can be expressed in the general form (6). For 
example, the likelihood ratio criterion Li for testing the homogeneity of sample 
variances [2, Neyman and Pearson 1931] has moments of this type. The gen¬ 
eralized Li criterion for samples from a normal multivariate population [3, 
Wilks 1933] has such moments. The criterion for testing sphericity [4, Mauchly 
1940] of a normal multivariate distribution has moments of this kind. All test 
criteria having this type of moment lie on the interval (0, 1). The exact dis¬ 
tribution functions of the criteria, except possibly for r = 1 or 2 in some cases, 
are very complicated. 

The purpose of this note is to consider a method of finding a fractional power 
of the test criterion which is approximately distributed (in a sense to be described 
later) according to an incomplete beta (Pearson Type I) distribution function, 

and to find the appropriate values of p, q, and the exponent of the criterion. 

2. Generalized hypergeometric series as moment generating functions. 

Suppose L is a statistical test criterion, or more generally a random variable 
having as its g-th moment the expression (6). The moment generating function 
<p{t) of L can be expressed as 

liQ-A. + i) 

( 8 ) 

This can be written as 


v(i) = r'+l Fr' 


1, - — Ai, • • •, - — A,'; t 

X X 

- - Bi, - -B,. 

X X 
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where r'+i^’r' [ ] is a generalized hypergeometric series [5, Bailey 1935], We 
shall not make explicit use of this fact; instead, we shall work with the coefficient 
of in the series, i.e.. Mg. 

Let UQ consider 

(10) hiMg = iilnfi - A* + l) - Elnfi - + 1V 

\C /j i-l \X fg 

To expand this in a power series in % consider a single term 

'“(s “ ^ +'). ■ S'°(i “ ^ 

(U) 

= -j?lna: + pin (1 - Aa;) + Z)lnfl + —-V 

j-i \ 1 — Ax) 

Now 


1 + 


- 


\ — Ax 


= 1 + + i-da: + yd a: + 


Writing 

5m(p) = 

and using the usual expansion for In (1 + a;), we find 

In^^ - d + l) = -plna: + - dp]x + [^d' + dSi(p) - iS 2 (p)]x” 

+ [-U" + d*« - AS^ig) + lS,(p)]a;* + 
Applying this expansion to the separate terms in (10) and writing 

(12) c^=Za7-J:b7 

>-i 1-1 

the terms not involving d, or cancel out leaving 
In Jlf, = (-Cig)x + + CM)W 

+ l-\C, -1- C,S^{g) - (7iSj(p)]x’ + ■ ■ . 
We shall return to this expression later. 

3. Powers of a beta variable. If u has (7) as its distribution function, then 

(14) B(m*) = 7-^^ . 

If r = u', T integral, then its p-th moment is given by setting A = rp in (14), 
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We have 


Eiv") 


(P)r 


(P + S)r, ' 


But 


so that 


(15) 


E(v^) = -r^ - - -^ 


r /e 

which is a special case of (6) when p is of order n. 

Putting - = ^ X, = 1 ^ (g _ t 1)/^^ 5, = 1 _ (t _ 1)/;. 

X r 


we have 


Ci = g, 


ft. i- + ,(i + j). 


For any given moment of the form (6), from which .t, Ci, and Ci can be com¬ 
puted, we determine p, g, and r so as to satisfy 


(16) 


P + Q ^ 1 
r X 


g = Cl 

and to satisfy, as nearly as possible, (ivith r integral) 


(17) 


i.e., 


7+9(1 + ;) = <^2, 


r = 


g(g + 1 ) 

C 2 -q 


The use of fractional r is obviously suggested, but its value and validity are 
not discussed here, Using the values of p, q and r thus obtained, the distribution 
of the criterion L (having moments (7)), is given approximately by 


(18) 


CVD^'O - VL)*-‘<i(VL) 
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where the approximation is such that all moments are correct through terms of 

order (—;— 1 (when moments are expanded in series of —;— ) and nearly 

\p + qj P + q 

(exactly if there is an integral value of r satisfying (17)) correct through terms of 



4. Examples. Returning to the g-th moment of given by the first ex¬ 
pression in (5) we have 

a:=-, r' — k — 1 
n 

. k + S- i p k- i 
- 2 -’ 

Cl = + 3A; - 6) 

1-1 *-1 

c, = - Sr! = + 2)(A + 3)(2^ -f 6) - 84] - . 

To determine p, q and r for the fitted distribution of Lmve we set 

P 4* ? _ « 

'~r 2 

^ = J(fc* + 3* - 6) 


„ ^ g(g + 1) 

Cs - q 

and solve for p, q and r. We have the following table of values, p, q and r for 
various values of k {p being calculated by using the roimded values of r): 
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Thus, by rounding r off to the nearest integer and using this rounded value of r 
in determining p, we have values of p, q and t for each value of k, which, when 
substituted in (18) give us the desired fitted beta distribution for • For 
k = 3, the fitted distribution is the exact distribution. 

For the g-th moment of L^e which is given by the second expression in (5), 

2 

it is convenient to expand in powers of-r. Hence we have 

Th ““ J. 



A. = ^ ^ ~ ^ B = —' 

2 ’ k-I 

Cl = i(k' + k-i) 

Cj = ^[(k + l)(k + 2)(2k + 3) - 30] - . 

To determine p, q and r for fitting the distribution function of L^c we put 

p + g _ n — 1 
— ^ 

q=i{k^ + k- 4) 

_ ^ g(g + 1) 

C2 - g ■ 


We have the following table of values of p, q and r for several values of k: 


(rounded) 



2 

2 


2 

2 88 

3 


4 

3.71 

4 


6.5 

4.52 

5 

2.5n - 12 

9.5 

5.32 

5 

2.5n - 15.5 

13 

6.14 

6 

3n ~ 20 

17 

6.88 

7 

3.5n — 25 

21.5 

7.82 

8 

4ti - 30.5 

26.5 

15.26 

15 

7.6n - 111.5 

104 


By rounding r off to the nearest integer, and using the rounded value of r in 
determining p, we have values of p, q and r for each value of k which, when sub¬ 
stituted in (18), give us the desired fitted beta distribution for L„e. For k = 3, 
the fitted distribution is the exact distribution 
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For a given value of k, approximate 5% and 1% points of -{/hnw &nd 
can therefore be obtained from Thompson’s [6] tables of the Incomplete Beta 
Function by entering the tables wit h Pi = 2g, and rj = 2p. For example, for 
fc = 6 the 5% and 1% points of are obtained by entering Thompson’s 

tables with n = 24, and vj = 5n — 24. 
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SOME FUNDAMENTAL CURVES FOR THE 
SOLUTION OF SAMPLING PROBLEMS 

By Edward C. Molina 
East Orange, N. J. 

1. Summary. In using collateral information in an inverse probability situa¬ 
tion to estimate a population fraction from a sample fraction it is necessary to 
use some particular form for the a priori probability function. This paper points 
out the advantages of using Kx^(l — x)‘ for this purpose. The application 
then involves only the Incomplete Beta Function. 

Graphs of the 10, 25, 50, 75 and 90 per cent points of the Incomplete Beta 
Function are given. They cover a range which includes and extends previous 
tabulations. 

2. Introduction. The engmeer, scientist or mdustrialist is often confronted 
with the following "sampling” problem; 

"The probability, p, of an event happening in a siugle trial is constant from 

trial to trial, but the numerical value of this constant is unknown. A series 

of n trials is made and the event happens c times, c < n. What light does 

this statistical data shed on the unknown value of p?” 

As a concrete, example, suppose that a new type of brakes is proposed for a 
given class of steam locomotives making the run from Buffalo to Detroit ^ 
Let each of 30 locomotives be equipped with a set of the new brakes and given a 
trial run Of these, 26 make satisfactory runs, so far as the behavior of the 
brakes is concerned; the remaining four encounter difficulties. Here, the event 
of interest is a satisfactory run, n = 30 and c = 26. What "weight” (confi- 
dence^) may the design engineer assign to the assumption that, say, 25/30 < 
p < 27/30? 

Practical decisions mvolving such, statistical data are usually based on a com¬ 
bination of the data with “collateral” information. In fact, the applied statis¬ 
tician is all too familiar with the extreme case where the statistical data are so 
meagre as to provide no information and where a decision must be made now — 
in these cases the decision is made solely on the basis of the collateral informa¬ 
tion, and rightly so 

The methods of statistical analysis and presentation developed up to the pres¬ 
ent have concentrated on the other extreme case, where the statistical data are 
so good that collateral information can be neglected. 

1 This fictitious example convicts the writer of total ignorance of railroad engineering 
Nevertheless, the illustration brings out, in concrete terms, the class of sampling problems 
under consideration 

* The purely intuitive meaning to be attached to "weight” and “confidence” is the same 
However, the curves presented with this paper are not based on the theory which underlies 
what are known, in statistical literature, as “confidence intervals”. 
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There is a real need for methods of analysis and presentation to be used where 
both the statistical data and the,collateral information should be used. How¬ 
ever, when the significance of the collateral information is adequately expressed 
by a function w(a:), x being a permissible value of the unknown p, the classic 
Bayes-Laplace theory (see [1]) of inverse probability gives the solution to a 
sampling problem. 

The purpose of this paper is to present a set of sampling curves based on a 
V)(x) function whose form embodies some important properties.’ 

3. Hardy’s collateral frequency function- Consider again the locomotive 
brakes problem. The new design may have been carefully engineered, in ac¬ 
cordance with well-known principles, to reduce costs at the expense of a slight 
reduction in reliability of operation. In such a situation, the collateral informa¬ 
tion would be somewhat as follows; There is a high "probability” that the un¬ 
known value of p is a little below the known value for the old type of brakes. 
Moreover, it may be assumed that the "probability” drops rapidly for values 
of p departing materially from this old value. Suppose the latter is p = .96; 
then the collateral information would be presented by some such curve as num¬ 
ber 5 m Figure 1, the mode (peak) of this curve being at .90, which is slightly 
below the old .95 value. 

Number 6, of Figure 1, belongs to the family of curves corresponding to the 
frequency function 


’w{x) — Kx'(l — x)' 

This form for w(x) was suggested, in 1889, by the British actuary Sir George 
F. Hardy (see [2]) for the construction of mortality tables. Its mode, mean 
and variance are given by the equations 

Mode = r/(r + a) 

Mean = (r + l)/(r -f- a -h 2) 

Variance = (r + l)(s + l)/(r + s -f 2)’(r -f b 3) 

G. J. Lidstone (see [3]) has pointed out that the Hardy form for w(x') has two 
important advantages: First—“By suitable choice of r and s any required values 
of the mode or mean and the variance of z, can be reproduced, and thus a great 
variety of distributions may be approximately represented.” Lidstone’s 
z* is our w(x). Second—"The factors x' and (1 — i)* unite in the simplest 
and most elegant way with similar factors in the Laplacian integrand . . . ”. 


’ Many statiatioiana, laoludmg a referee of thia paper, feel that it ie a common aituation. 
to have the collateral information ao vague and eluaive that it ia virtually impoaaible to 
take it into account via inverse probability. (The author doubts this.) Such statisticians 
may wish to use the Clopper-Pearaon confidence intervale, using no collateral information, 
in which case these curves can be used as indicated by Schefi4 (“Note on the use of the 
tables of percentage points of the incomplete beta function to calculate small sample 
confidence intervals for a binomial p”, Biometrika, August, 1944) 
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From this second advantage there follows a third which will be presented in 
section 6 below. 

4. Theory. The Bayes-Laplacian formula gives us 

(1) Pip ^ X) = f\ix)x°(l - xy~‘dx j j\ix)x‘il - x)’'~‘dx 
for the “a posteriori probability” that p < X. In this formula, the product 

Fiq. 1 

Particular forms of ihe a priori {collateral information) function: 



Curve 

r 

X 

Form 

1 

0 

0 

K 

2 

i 

i 

Kxi{l - x)i 

3 

1 

1 

Kx{l — x) 

4 

2 

1 

Kx*(l — x) 

5 

9 

1 

Kx>{l - x) 


x‘(l x)’‘~‘ takes care of the fact that the event happened c times in the n 
trials; the factor w(x) represents, quantitatively, the collateral information. 
Adopting, now, Hardy’s frequency function, we assume that 

(2) wix) = Kx’^il — i)*. 
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r and s being assigned values in accordance with the collateral information 
pertaining to the particular problem under consideration. Theoretically, the 
constant K should be such that 

I w(x) dx = 1, 

Jo 

but, since w(x) enters in both numerator and denominator of (1), any desirable 
value may be given to K. Advantage has been taken of this in constructing 
Figure 1; to facilitate comparison of the five curves shown therein, for each 
curve K is such that the maximum ordinate is equal to 1. 

The second advantage, pointed out by Lidstone, of the form adopted in this 
paper for the function w(x) becomes apparent immediately on substitution of 

(2) in (1). We obtain 

(3) P(p <X) = x^'a - 3?)'""° ^/1^ *‘'(1 “ ^ 

with C = c + r and N = n + r s. Therefore, a siTigle family of fundamental 
curves, plotted with reference to C and N, will give the solutions for a multitude 
of different practical problems. To solve a particular problem, for which the 
values of n, c, r and s are specified, we merely enter the curves with C = c -f r 
and N = n + These linear relations transform all a posteriori curves, 

published on the assumption that w{x) is a constant, into fundamental curves; 
namely, that they are applicable with the more general form (2). For example: 
The information given on the sheets of inverse curves (inserted in the back cover 
pocket) of Col. Leslie E. Simon’s Engineer’s Manual of Statistical Methods in¬ 
cludes the restriction “that prior to sampling, one lot fraction defective is as 
likely as another”. It is now obvious that the use of Col. Simon’s curves is 
not so limited; his curves may be used in any situation wherein the available 
collateral information is covered by the assumption that v){x) has the Hardy 
form. Likewise, the “Weight = ,98” and “Weight = .8” curves (“confidence”, 
in the intuitive sense), presented by R, P Crowell and the writer in their paper 
now have a much wider range of applicability. 


6, Curves. The ratio of definite integrals in equation (3) is tabulated, in a 
different notation, in “Tables of the Incomplete Beta Functions”, edited by 
Karl Pearson. 


This paper 

C 

N -C 
X 

Pip < X) 


Pearson Tables 
p - 1 
2 - 1 
X 

tabulated value 


ThompBon Tables (see [6]) 
(Vi - 2)/2 
ivi - 2)/2 
tabulated value 
caption to Table 


The range of values of C and (N — C) covered by the Pearson Tables is indi¬ 
cated by the shaded area in Figure 7. For curve points falling outside this 
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range (except for C = 1 and 2, found from the binomial summation by trial 
and error) recourse was had to a senes developed by the writer for the solution 
of some problems confronting him, as Svvitchmg Theory Engmeer, in the Bell 
Telephone Laboratories Many points of the C = 1,2, 3, 4, 5, 6, 7, 8, 9, 10, 12 



1 2 4 6 8 10 20 40 60 80 100 

s —► 


PiQ. 2 

and 14 curves can be obtamed directly from the Thompson Tables. They do 
not, however, give any points for the C = 16,18, 20, 25, 30, 40, 45 and 50 curves. 
It may be added that, except for certain marginal values, the Thompson Tables 
were also derived from the Pearson Tables 
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Kp-pj) * *75 
o 



I—«- 


Fio. 3 


Five sets of fundamental curves are submitted, namely, 


Figure 2, 
“ 3, 

“ 4 , 

“ 5 , 


P(p <X) ^ .26, Z = pi 

" = .75, X = p2 

" = .10, Z = Pi 

" = .90, X = Pi 


“ 6, “ = .50, X = po 

It 'Will be noted that pi has been written instead of X for the curves such that 
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P {p <X) is less than .50; likewise, pj for X for those corresponding to P {p<X) 
greater than 50; pa for X for the P {p < X) = 50 curves. 



1 —- 


Pio. 4 

For each pair of values of C and N, the curves of Figures 2 and 3 give the range 

P{p\ < p < ps) = .50 

whereas, the curves of Figures 4 and 5 give the range 

P{Pi < P < P 2 ) = .80 

As an example of the applicability of the fundamental curves, let us reconsider 
the locomotive problem for which n = 30 and c = 26 It was suggested that 
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■ .90 



Fia. 6 


the T 9, s 1 curve of Figure 1 might well represent the collateral information 
available. Therefore we take A^ = 30 + 9 + l = 40 and C = 26 + 9 = 36. 
Entering Figures 2, 3, 4 and 5 with this data we find 


Fig. 

Pip < pi) 

Pi 

[ 

Fig, 

P(.P ^ Pi) j 

Pi 

2 

.25 

.83 


3 

.75 

.89 

4 

.10 

.79 

1 


5 

.90 

.92 
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B -► 


Fig. 6 

Thus we have, for the unknown probability of a successful run with a new set 
of brakes, 

. 83 < p < . 89, with weight . 60 

and 

.79 < p < .92, with weight .80 

6. Sequential property of the curves. The original draft of this paper was 
submitted to Dr. W. V. Houston* in connection with the solution of a problem 

* Of the California Institute of Technology and now President of Rice Institute, Hous¬ 
ton, Texas. It was Dr. Houston who gave the impetus to the publication of this paper. 
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“Tables of The Incomplete Beta-Function,” edited by Karl Pearson, can be used for 
evaluation of 


f a:®(l — ° dx 

Jo_ 

f a®(l — x)^~^ dx 
Jo 


only when values of (N — C) and C are in 



Pig. 7 


in which he was interested. Regarding equation (3), Dr. Houston made a very 
significant comment, the burden of which may be stated as follows: Suppose 
that before the series of ?i trials had been made, it was known that, at some 
earlier time, a series of r -f 5 trials had resulted in r successful outcomes. Sup¬ 
pose, moreover, that the collateral information called for the assumption that, 
a priori, all values of p were equally likely. Under these circumstances equation 
(3), derived by substitution of (2) in (1), gives P{p < X) for two consecutive 
series of trials, one of r + s with r successes followed by another of n with c 
successes. An immediate generalization of Dr. Houston’s thought shows that 
the fundamental curves may be entered with 
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JV = ni + rii + • ■ • + ‘ + r + s, 

C = Cl + Ca + • • • + c< + • • • + Cm + r, 

for the solution of a problem involving m consecutive senes of trials, n, and c, 
being the number of trials and successes, respectively, in the ith series; the in¬ 
troduction of r and 5 removing the restriction that all values of p were a priori 
equally likely. 
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ENLARGEMENT METHODS FOR COMPUTING THE INVERSE MATRIX 

By Louih Guttma?) 

Cornell Umversity 

1. Summary. The enlargement principle provides techniques for inverting 
any nonsingular matrix by building the inverse upon the inverses of successively 
larger submatrices. The computing routines are relatively easily learned since 
they are repetitive. Three different enlargement routines are outlined: first- 
order, second-order, and geometric. None of the procedures requires much more 
work than is involved in squaring the matrix. 

2. Introduction. A set of methods is presented here for computing the in¬ 
verse matrix, based on what we shall call an enlargement principle. The princi¬ 
ple is to build the inverse upon the inverses of successively larger submatrices. 
This leads to simple repetitive routines that are not unlike iterative steps, but 
afford a direct solution. 

The basis for such routines has also been noticed before,' but does not seem to 
have attracted the attention it merits. A possible reason for this lack of atten¬ 
tion may be the belief that the methods apply only to a restricted class of mat¬ 
rices. We establish a simple lemma in this paper which shows that the enlarge¬ 
ment methods apply to all nonsingular matrices, so that their use is perfectly 
general. 

The enlargement principle may be considered an opposite of the “condensa¬ 
tion” principle that governs Gauss’ method of elimination and its variants such 
as the Doolittle procedure and Aitken’s “pivotal condensation.”® It is interest¬ 
ing that the same formula upon which the enlargement methods are based can 
also serve as a foundation for the condensation methods, as is shown in section 
7 below. 

The enlargement methods have the following characteristics: 

(1) The first-order procedure outlmed in the next section has been learned 
by statistical clerks in about ten minutes. People w'ho calculate inverses only 
occasionally and forget the process between times should find the method as 
economical as those who must constantly compute inverses. 

(2) They are direct methods, and yield an exact answer with not much more 
work than is involved in squaring the matrix. 

(3) They can be adapted to electric punch-card systems, which will be effi¬ 
cient when very large matrices are to be inverted. 

* It has appeared earlier in [2]. Waugh’s recent note [10] also rediscovers the basic for¬ 
mula although only a specialized use is suggested there. Professor Harold Hotelling has 
called my attention to reference [1], which overlaps substantially with the present paper, 
and to a use of an enlargement approach to computing latent roots and vectors [9]. I am 
also indebted to Professor Hotelling for other helpful comments on the present paper. 

* For an excellent summary and bibliography of direct and iterative methods for com¬ 
puting the inverse matrix sea ([5], [6]). 
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(4) A sequence of inverses is yielded. Exact inverses of successively larger 
submatrices are computed in the routines, and these inverses are often them¬ 
selves of interest. For correlation problems, this means that a sequence of sets 
of successively higher order multiple correlation constants is produced routinely. 

(5) The general formula upon which the methods are based allows many varia¬ 
tions in procedure, so that special adaptations can be easily made for special 
matrices 

A “first-order” enlargement procedure for computing the inverse matrix will 
be outlined in the next section. The proof for the method follows from the gen¬ 
eral formula in section 4. This procedure and formula are also described in 
[2]. Other enlargement routines are described in subsequent sections. Some 
additional formulas of relevance are discussed in section 8. 

3. First-order enlargement. Let the matrix whose mverse is desired be 

flu flu ■ • • fllii 
flai fl22 • * • fljii 


II fliil OnS ’ * • 0«n 

The following sequence of successively larger principal submatrices will be as¬ 
sumed to be nonsingular: 




flu 

On 

flit 

flu 

<Il2 

A.3 — 021 




f 

On 

023 

flai 

022 

Oji 

On 

083 


If necessary, the rows and columns of A„ can always be shifted to obtain such a 
sequence. The following additional notation will be used: 

Bt = (ai,i+iflj,i+x *•* flj.i+i) 

C, = (fli+i.ifl.+i.a — fl<+i.i) 
d, = o,+i,{+i. 

Thus, we can write 

At Bt 

Ai+i = , , (i = 2, 3, — 1) . 

Ci dt 

The first-order enlargement procedure is to compute m turn ■ • •, 

The inverse of As is computed by the traditional steps: 

(1) Compute A = anaja — flaiflia, and compute 1/A 
[2} Then 

A —A ^fli2 

“■A '<121 ^ ^11 





338 


LOUIS GUTTMAN 


Remember that 5 2 = (niactza)) Ci = (oai ^132), aud that dj = 033 The steps 
for computing are as follows: 

{3} Compute E 2 = . 

(4) Compute/a = . 

{5] Compute I// 2 . 

(6) Compute Ga = , and compute Hi = fi^CiAi''. 

(7} To each element m add the product of the corresponding elements 
in El and to form Ki = Aj^ + EiEi. 

Then the third order inverse is 


Ai 


1 


Ki ~Gi 

-Hi l/fi 


In general, to obtain A 7+1 from A7^, (i = 2, 3, • * , n — 1), imitate* steps 
(3) through {7}: 

{3'] Compute E'i = AT^Bt. 

(4') Compute/, = d{ — CiE[, 

{5'( Compute fT^ 

(6') Compute G\ = fT^E] , and compute 77, = /7^C,A7^ 

{7'J Compute K, = A~^ + £<77,. Then 


A 


-1 

«+i 


Tc. -g; 

-Hi 1/A 


By repeated applications of steps {3'j through {7') to the successively larger 
A7^, A'^ is attained. 

If An is symmetric, then almost half the work is saved, for then Bi = Ci, 
G, = H,, and 77, is symmetric, (z = 2, 3, • • • , n — 1). 

To help gauge the amount of work needed to arrive at A7^, let us compare it 
with the work that would be needed to square A„. For the general asymmetric 
case, v} product sums of n terms each are required for An , a total of n multipli¬ 
cations. With calculating machines, the sums of the products are accumulated, 
BO that no separate process of addition is mvolved. To reach A7^ by the above 
enlargement method, n* — n multiplications are required. Most of the addition 
is accomplished in the process by accumulative multiplication, but an additional 
n(n — l)(2n — 1) 


6 


-h n — 3 terms have to be added otherwise. Furthermore, 


n — 1 reciprocal numbers are needed. Thus, An’' involves somewhat less multi¬ 
plications than does An, but needs more additions, as well as some reciprocal 
numbers. 


• Actually, these steps could be used immediately in place of steps (1} and [2) to com¬ 
pute A r', by lettmg i = 1, and letting A, = On (which may be assumed different from zero). 
The traditional method, however, is quicker for the 2x2 matrix. 
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In linear multiple correlation problems, if A^+i is the correlation matrix of the 
first I + 1 variates, then consists of the regression coefficients of the first i 
variates for predicting the (f + l)th variate, and fi is the square of the multiple 
correlation coefficient for this regression. 


4. A lemma and the general formula. The enlargement procedure just out¬ 
lined is one of many possible routines which can be developed from a general 
formula for the inverse matrix in partitioned form, This formula seems to have 
appeared first in [2], where it is stated that the method applies only to the cases 
where/, 0 in step {4} We shall establish here a lemma that shows that this 
is no restriction, for the submatrix in step (4} is always nonsingular Our lemma 
proves that the enlargement methods will invert any nonsingular matrix. 

Let A„ be a nonsmgular matrix of order n, partitioned in the form 


( 1 ) 


A 

B' 

C 

D 


where A is of order m, (1 < m < n), and will be assumed nonsingular B and 
C are of n — wi rows and m columns, and D is of order n — m. 

The following lemma is needed to show that enlargement methods will invert 
any nonsingular matrix: 

Lemma If tn {!), both A„ and A are nonsingular, then the matrix 


( 2 ) 


F = D - CA-^B' 


IS nonsingular. 

For the proof, postmultiply the first submatric column of A„ by A~^B' and 
subtract from the second, leaving 


A 

0 

C 

F 


M differs from A^ only by an elementary transformation; hence its rank is that 
of j 4„ But clearly the rank of M is the sum of the ranks of A and F. There¬ 
fore, the rank of F is n — m, and F is nonsingular. 

The inversion formula itself is the following identity: 


A B' 

—1 

A~' + B'F~" CA'^^ - A"" B'W^ 

C D 


-F~‘CA~‘ F-' 


A direct verification that the identity holds can be obtained by multiplying the 
right member in either direction by the right member of (1), yielding the unit 
matrix. 

In section 3, the formula exhibited for Al+i at step {7'} is easily identified 
as a special case of formula (3) where n = i -j- 1, m = i. F corresponds to /,, 
which is a scalar number; hence F-’- is easily computed in this case. 
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6. Second order enlargement. In formula (3), once A~^ is given, the rest of 
the work is essentially straightforward matric multiplication, except for com¬ 
puting F~\ In section 3, P was easily inverted since it was of order unity. F 
Can also be easily inverted if it is of order two, so that a second order enlargement 
procedure is feasible, computmg -47+2 from ^47^. The steps are similar to those 
in section 3 but involve larger matrices. 

Letting At have the same meaning as in section 3, define now Bi,C{, and Dj 
according to the partitioning 

II II 


Then and C, are of two rows and i columns, and D, is of order two. Compute 
47^ as in section 3. From then on, to compute 47+2 from 4,, the steps are*. 
[3"} Compute B'i = 47*.B,. 

{4"1 Compute ~ CiE\. 

{5"j Compute F~i^ by steps [1] and (2] of section 3. 

(6'') Compute G[ = F^^E'i , and compute ifv = ^7’'C'v47^. 

17") Compute If. = 47' + E'iH ,. 

Then 


4{+i 


Ki -Of 
-Hi F7' 


If n is even, successive enlargements will lead 47'. If n is odd, then 47ii is 
attained, from which 47' can be computed according to section 3. 

The number of multiplications and additions for this procedure is the same as 
for section 2 However, less writing is involved since only about half as many 
4. are inverted. A disadvantage is that it is more complicated at each stage 
than is the procedure of section 3. 


6. Geometric enlargement. Another routine is that which may be called 
geometric enlargement. Here, 47' is computed from 47'. The steps may be 
described as follows. Letting 4, have the same meaning as previously, redefine 
Bi , C,, and Di according to the partitioning 

. 11 

.^ 2 * — 

Ik. Dt 

Then J9., C,, and D, are all, like 4,, square matrices of order i. Compute 47' 
according to steps {1) and (2), and compute 47' according to steps {3") through 
{7"}. From then on, to compute 47.' from 47', the steps are formally the same 
as before, with a complication in step {5'"!: 
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(3'"} Compute E\ = A-^B[ . 

(4"'} Compute F, = Df — C^E'i . 

{5'") Compute by geometric enlargement in the same way as A~^. 
{6'"j Compute G\ = F^^E\, and compute Ht = FT''CiA7^. 

{7'"} Compute Ki = AT^ + E[Hi. 

Then, 

II -G. II 


This method involves less writing than the others, but is more complicated. 


7. Condensation methods; special cases. Formula (3) also affords a basis for 
condensation methods by “back solution.” For example, let A be of order m, 
where m is one or two so that A is easily inverted. Then F is of order n — m, 
and we will denote it by Fn-m • Partition into the form 


F = 

* n—m 



£ 1 ( 2 ) 


£( 2 ) 


where is again of order m, defining of order n — 2m. Continue the 
process until an F, is reached which is easily inverted, and solve backwards to reach 
F n~Hn) and then An, by repeated use of (3). 

Formula (3) is of great help in those special cases where A is large but easily 
inverted, such as a diagonal matrix, orthogonal matrix, etc. The labor can then 
be focussed on inverting an F which is much smaller than A„ . 


8. Further identities. It is of some interest to exhibit some matric identities 
relevant to formula (3), Using the notation of section 4, let us seek the inverse of 
An partitioned in the form 


(4) An^ 

An equation to be satisfied is 


W X' 
Y Z 


W X' 


A B' 


I 0 

Y Z 


C D 


0 I 


which yields the equations 


(5) 

WA +X'C = I 

(6) 

WB' + X'D = 0 

(7) 

YA+ZC = 0 

(8) 

YB' + ZZ) = I. 
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If A and D are nonsingular, then from (6) and (7), 

(9) X' = -WB'D-\ Y = ~ZCA-\ 

Using (9) in (5) and (8), and remembering the lemma of section 4, we obtain 

(10) W = (A - Z = (D - CA-^B')-'. 

Using (10) in (9) yields 

(11) X' = ~{A - B'D-^C)~^B'D-\ F = -[D - CA-^B')-^CA-\ 
Putting (10) and (11) into (4) completes the formula 

(A - B'D'^ C)“‘ - (ri - B'D-^ C)-^ B'TT^ 

-{B ~ CA-^B')-^CA~" {D~CA-^B'y^ 

Comparing (3) with (12), we have the identities 

(13) {A ~ B'D-^C)-' = A-' + A~'B'iD - CA-^B')~^CA-'^ 

(14) (A - B'D-^C)-^B'D~^ = A-'B'(D - CA-^B')-\ 

which may of course be verified by direct simplification 
An important feature of each of these identities is that the matrix in parentheses 
on the left is of order m, while that in parentheses on the right is of order n ~ m. 

A special case of (13) was noticed by the writer [3], [4] and of (14) by Leder- 
mann ([7], [8]) and the writer ([3], [4]), in connection with regression problems 
of factor analysis In this special case, A is a diagonal matrix and hence easily 
inverted, n — m is the number of common factors, which is usually small com¬ 
pared with m, the correlation matrix of m observed variates is given factored 
into the form A — B'D~^C ; and the work of inverting the correlation matrix of 
order m is simplified essentially into inverting a much smaller matrix. 

It should be noticed that (12), (13), and (14) assume that both A and D are 
nonsingular, where (3) assumes only that A is nonsingular (since then F must be 
nonsingular from the lemma of section 4) 
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THE FREQUENCY DISTRIBUTION OF DEVIATES FROM MEANS AND 
REGRESSION LINES IN SAMPLES FROM A MULTIVARIATE 
NORMAL POPULATION 

By D. J. Finney 
Oxford Universily, England 


1. Simunary, The joint frequency distribution has been found for any set 
of the (n — k) deviates from their sample mean of each of the t variates in a sam¬ 
ple from a multivariate normal population. Expressions for the variance of any 
single deviate in this distribution, the correlation coefficient between any pair 
of deviates, and certain partial correlation coefficients between any pair have also 
been obtained. 

These results have been generalized so as to include the corresponding proper¬ 
ties of deviates from a set of t multiple linear regression equations estimated 
from the sample, the m independent variates being the same for each of the t 
dependent. 


2. Introduction. Some years ago, Irwin published results relating to the fre¬ 
quency distribution of the deviations of individual observations from the mean 
of a sample drawn from a normal population (see [1]). He derived an expression 
for the joint distribution of any number of these deviates, which distribution 
is always of the normal multivariate form, and thence obtained the total and 
partial correlation coefficients between any pair of the deviates. 

The purpose of this paper is to discuss the generalization of Irwin’s problem, 
firstly to the properties of the deviates of individual observations from the mean 
in a sample from a multivariate normal population and secondly to the properties 
of deviates from a regression e quation instead of from a mean. So far as is known 
to the writer Irwin’s results are of little practical importance, and these generali¬ 
zations are probably of no practical value whatsoever. Nevertheless, they have 
some interest as additions to the knowledge of the mathematical properties of the 
normal frequency function, and for that reason alone they are put on record here. 


3. Deviations from the sample mean. Irwm based his discussion on a normal 
population with mean m and variance c*, but the algebra is simplified a little, 
without any real loss of generality in the final results, if, by means of a prelimi¬ 
nary transformation, these parameters of position and scale are made zero and 
unity respectively. The multivariate normal distribution in the t variates .-y, 
{i = 1, 2, ■ • • <), each with mean zero and variance unity, has the frequency 
function 


( 1 ) 




exp < - 


25: 


p'^2/ iV 


} 
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where i, j = 1, 2 ’ • ■ • t; p*’ is the cofactor of the element in the determinant 
of population correlation coefficients 


( 2 ) 


R = 


1 P12 PlJ ■ 
Pl2 1 P23 ■ 


Pl( 

Pit 


PU pit Pit ' ‘ ' 1 


A summation convention for the aflSxes i, j is understood throughout this paper, 
except when the contrary is explicitly stated. 

Let (iyp) represent a sample of n independent sets of values of the t variates 
randomly selected from the population, (p = 1, 2, ■ ■■ ,n). Then the element 
of probability for the sample is 

^ i P , * ■ 


If is the mean of the n sample values of {y, the deviates from the mean are 
{iYp), where 



the summation being taken over q = 1,2, •• • n with 


S 


Vi ~ 


1 if 
0 if 


p = g 
p ^q. 


Now the , Y are linear combinations of normally distributed variates, and are 
therefore themselves normally distributed. Clearly 


( 6 ) E{tYp) = 0 

and, from an expansion by means of equation (4) using 


(6) / i\ 

EiiYp.Y,) = (Sp,- 

wherep,, = 1 (not summed). Consequently the variance of any one deviate is 


(7) 


<r\iYp) = 


n — 1 


n 




and the correlation coefiicient between any pair is 

(8) p(.rp,,y,) ^ P.,- 

Equation (7) and equation (8) for the particular case oi i = j agree with the 
weU-known results that Irwin has already given as equations (10) of his paper. 
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For any i, only (n — 1) of the deviates ,Fp are functionally independent. The 
joint distribution of these for p = 1,2, (n — k) may be obtained from an 
inversion of the matrix of correlation coefficients. If A is the determinant of 
this matrix and A(<7p , ,7,) the cofactor corresponding to the two elements 
specified, this inversion shows that 


The joint distribution is therefore 

(10) const. X (sp, + 0 ,7p ,7,} IKdT). 

Now A may be evaluated as 


and the constant multiplier in equation (10) is therefore 


( 11 ) 



{(2x)‘ 


From equation (9), the partial correlation coefficient between any two of the 
variates in the distribution (10), the remaining t(n — k) — 2 being held constant, 
is written down as 


(12) partial correlation coefficient between ,7p , j7g 


fc5p, + 1 p*’ 

fc + 1 ■ (p*'p")*' 


the summation convention is suspended for this equation. 


4. Deviations from regression equations. The results obtained in section 
three may be generalized so as to relate to the frequency distribution of deviates 
from linear or polynomial regression equations instead of to deviates from means. 
Suppose that there are tn independent variates x", {a = 1, 2, • • ■ , m), which 
take values a;* corresponding to the sample observations ,-i/p ; polynomial re¬ 
gressions may be included by taking powers of an x as separate variates. If a 
conventional variate /, whose value is always unity, be introduced, the regres¬ 
sion equation of ,y on (a = 0, 1, 2 • • • , m), may be written 

( 13 ) iv = 

where a summation convention is understood for « = 0, 1, • • • , m and the 
regression coefficients are the solutions of the normal equations. 

.b“ = £ ^yv^P■ 

p p 


(14) 



Write 

(15) 
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P 

and let (Bai) be the inverse matrix of (B“^). 

Then the solutions of equations (14) are 

(16) .6“ = 

P 

If the deviation of ,yp from the regression equation (13) is {Zp , then 

tZip = ,j/p it}p 

~ (®p4 “ Ba^XpH^^ ^yq, 

4 

the summation for q being over g = 1, 2, • • • , n. As for equation (5), 
(18) Ei^Zp) = 0. 

Also 


(19) Ei^ZpiZq) = (Spq - 

since by definition 

= dpy. 


Write now 0 for the squaie matrix of (m + 1) rows and columns whose elements 
are the B“^, and Xp for the single column matrix of values x corresponding to 
the pth observation; i.e. 

(20) e = (5“") 


and 


( 21 ) 



Write also 


(22) ep.q... , = fl - XpX'p - XqX'q - XrX'r - • ■ ■ ■ 

Then 

Iffpl = [ fil'd -B„flx“p4), 

and 

I fip51 — I Opg + XpXq\ = — I fi I'BnUXpXj. 
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Hence, from equation (19), the variance of a deviate may be written 

(23) ^=(.Zp) = 

and the correlation coefficient between any pair of deviates is 

fpii (p = q) 

(24) + 

r 

For any i, only (n — m — 1) of the deviates |Z, are functionally independent. 
The joint distribution of these for p = 1, 2 , • • • , (n — fc) and any k > m + 1 
may be found by inversion of the matrix of correlation coefficients obtained from 
equation (24). The multiplier of the exponential in this distribution of i(,n — k) 
variables is 

lejUin-fc) 

(2t)*“"“*’ f)*' ’ 

where 


D = 


Mil 

|5lj| — \6a + XlXj] 


- \$i.p-k + x,xU| 

jpisl — + X 1 X 2 I 

Ifel 

i92,T,-sl 

— \92,n-k 4* XjXi_i|,| 

— |5l,n-it+XiXn-*l 

162 , 11-^1 ~ 1 92,n~k-\‘X2X n-k\ 

... 

1 1 


Since 6 is positive definite, there exists a non-singular matrix K such that 

K9K' = I. 


Then the Xp may be transformed to new column matrices Wp by 


and consequently 
It follows that 


KXp = IT, = 


W^p 

1 

Wp 

wi 


w, 


Xp = ir^Wp 


\ep\ = \e\-\i-WpW'p\, 

which may be reduced to the form 
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Similarly 


1 ^PS 1 

1 Opg + XpXg 1 = -1 

6 1 W%Wg . 

Hence 


1 — Wiwt 

a a 

—W 1 W 2 

-wtwl-k 

Z) = 1 fl 1""* 

— Wiwt 

1 — W 2 W 2 ■ • • 

— WiWl-k 


—Wiwl-k 

— WiWn-k ■ • • 

1 Wn^'^n—k 


This may be transformed into 



D = \ e I""* 




I Wi • • • TTfi-it 
= I e r~*-1 1 - WxW[ - WiWi - 

= I e I" * 1 |. 


W'.-k 

Im+l 

■ -Tf_*F'n_* I 


Thus, finally, the constant in the distribution is found to be 

1 / L^V‘ 


(26) 


in which Qh has been written for 5i,2, ,(n~k) > a matrix of the same form as $ 
but calculated from the last k sets of observations only. 

The cofactors of the matrix of correlation coefficients, required for the coeflfi- 
cients of the quadratic form in the distribution, can be derived in a similar man¬ 
ner. The distribution may be written 


1 _ /MV' 


of which the distribution (10) is easily seen to be the particular case for m = 0. 

From (26), the partial correlation coefficient between any pair of deviates, 
,Zp and jZg , may be written down as 


(27) 


I Qk + XpXg I -H {Spg — 1) I I p'^ 


m this expression the summation convention is again suspended. 
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ON THE ASYMPTOTIC DISTRIBUTIONS OF CERTAIN STATISTICS 
USED IN TESTING THE INDEPENDENCE BETWEEN SUCCESSIVE 
OBSERVATIONS FROM A NORMAL POPULATION 

By P. L. Hsu 
Columbia University 

1. The statistics to be considered here have the general expression 

T = Q = S a„(x, - £)(x, - x), <S = I] (x. - xf, 

■where (a:i, • ■ ■, xn) is a sample fiom a normal population whose mean and vari¬ 
ance can evidently be assumed to be 0 and 1 respectively/ The purpose of this 
note is to study the asymptotic distribution of T assuming that the Xi are inde¬ 
pendent. The whole work may be regarded as a straightforward application of 
Cramer’s theory of asymptotic expansion (see [ 1 ], pp. 69-88). 

If A = [a, j] and 7 is the row vector 1, • ■ • , 1, 1] the quadratic form Q 

has the matrix (I — 7 ' 7 )A (7 — 7 ^ 7 ). The latent roots of this matrix, which are 
also the latent roots of A(Z — 7 ' 7 )^ = A(I — 7'7), 'will be denoted by 0 , Xi, • • •, 

, with n - N — 1. Then Q and S can be simultaneously diagonalized (by a 
rotation of the A^'-dimensional space), so that 

S = t>yl, 

rwl tmI 

where the j/r are again independently and normally distributed ■with zero mean 
and unit variance. 

We shall make the following assumptions 

(a) I Xr I < 1 for all r. 

(b) There is a positive number c independent of n such that 

n 1 

2 (^r - > cn, where X = - 2 . 

r-l n r_l 

Write 

4/2£(X,-X)*» 

* ~ / == I Sfrt(a:) = (Xr — X — z) , 

— 2nx^ r-l 

A, = (X, - X — z)(yl — 1), G{x) = Pr{T < X -f- 2 }. 

iThe exact and the approximate distribution of such statistics were a recent subject of 
study by a number of statisticians See W. J. Dixon, “Further contributions to the prob¬ 
lem of serial correlation," Annals of Malh Slat., Vol 15 (1944), pp. 119-144. Further 
references are listed in Dixon’s paper 
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Then it can easily be verified that 

This expression of 0{x) shows that the application of Cramer’s expansion is at 
hand, since E{Xr) = 0 and 2s2(x) is the variance of SX,. Let ptn and Tin 
stand for the same quantities as defined in Cramer’s work (see [1], pp. 70-71). 
Since moments of all order of Xr exist, we may use 2fc + 2 in place of k. We have 


P2*!+2,n 


- Wifi2j:+2(a;) 


Tlk+i.n 


n 

A zhh+i ’ 
‘»P2A;*f-2,n 


where = E(y^ — and i/ is a normal variate with mean 0 and variance 1. 

By virtue of assumption (a) [ T | < 1 Therefore we may confine ourselves 
to the range of values for which | \ + z | <1. Then | X, — X — z | < 2. Also, 
by assumption (b), Sjfa:) > S(Xr — X)* > cn. Hence pjit+ 2 ,n, and in conse¬ 
quence VnTTtw.n , are less than some constant independent of n and x. The 
remainder of Cramer s expansion, if it is justifiable, will therefore be less than 
where M is independent of n and x. The justification consists in verifying 
that the following condition is satisfied: if /,(0 is the characteristic function of 
Xr and A is any positive number, then 


n 


l.U.b.ni/r(0 

r-1 


for 


I Tik+2,n 

V 2s2(x) 


is less than Mi7Vfc+2,n , where Mi is independent of n and x (see [1], p. 85). Since 
Tik+ 2 ,n < i\/n ^ and S 2 (x) > c-\/n, it is sufficient to show that, if a and A are 
any positive numbers and if 


n 


U = l.U.b. n \fr{t) i 
r—1 


for 


> a. 


then U < Min ^ , where M^ is independent of n and x. Now 
|/,(0 I = U + 4 {"(Xr - X - z)“)-* 

whence 

U = ia\\r - X - z)’)-^. 

r-il 

Let p be the number of K for which (X, — X — z)’ < \c Then cn < 82(1) < 
^c(n — fi) -H 4|i; hence cn < (8 — c)p and 

C7 < (1 -f 2a“c)“*'‘ < (1 + 2o*c)"""'‘"*"‘” 

This shows that the desired condition on U is satisfied, and that therefore 
Cram4r’s procedure can be adopted. 


•This follows from the fact that Pn+i.n > 1- Of- Cramdr, [1], p. 70. 
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Wherever Cramer’s asymptotic expansion is valid, the terms in the expansion 
are most conveniently obtained with the help of Cornish and Fisher’s symbolic 
expression (see [2]): 


where 


4'(a:) = 




ft 

I »-*•'* 


dy 


and 7 y is the jth semi-invariant of the random variable whose distribution is 
under asymptotic expansion. In the present case we have 


where 


21 


/8y(a:) 


n 





Hence we may express our result as follows; 

(1) G(x) = exp [ g {£) ] 

where ( Rk{x) | < Mn~*, and M is independent of n and x. The symbolic ex¬ 
ponential in (1) is to be expanded as far as and including the term in 


2 . Let us apply the result (1) to the following three statistics: = Qa/S, 

(o = 1, 2, 3), where 

It 

Qi = 12 (xi - £)ix,+i — z) with a:w+i = Xi, 

1-1 

AT-l 

Q» = ^{xi - xf -h \{xy - i)* + (xi - f)(a:,+i - £), 

1-1 

w-i 

■ Qs = 2 (a:; - - x). 

<-i 

Ti is simply related with T* = Q*/S, where 

Q* = Z {xi - xt+k?-, 

for we have Qi — 8 — whence ^2 ~ 1 — iT* We shall write for the 
X’s corresponding to Qa, and 

6.. = Z (Xi"’)", 

r«l 


(a= 1, 2,3). 
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(i) For Qi we have Xr“ = cos ^ (see [3]). Since 


cos 






we have 




2r(3/-m)</W 


If m < n, then 

n 

2 = -1 if J jm, = n if j = im. 

r*l 

JV / m\ 

Hence, for m < n, 6mi = — 1 if m is odd, bmi = ^(i ) — lifmis 

\2^/ 


even. 


In particular 




n r-l 


71 — n — 2 
2n 


> 0.4n if n > 7. 


Hence assumptions (a) and (b) are true (for n > 7). The Sj(x) are conveni¬ 
ently computed with the help of b^i. The ^,(x) are then computed to yield 
the terms in (1). 

(ii) The X’s corresponding to Q* are 4 sin* ^ (see [4]). Hence 


, ( 3 ) 

X” = cos ^ . 

N f m\ 

By a computation similar to that in (i) we easily obtain bmi = ^ I J “ ^ 
even m and bmi = 0 for odd m, provided m < 2n. In particular, x!*' = 0, 
S(Xj*' — X^*’)* = ^ ^ > -dw for n > 6. Hence assumptions (a) and (b) are 

true (for n > 5). 


(iii) In the case of Qj the matrix A is 


A = 


0 


0 


i 0 h 
h ■ 


0 I 


whose latent roots are cos 7r</(iV + 1), (< = 1, • • • , N) (see [5]), aU less than or 
equal to unity in absolute value. It follows that the same is true for the Xj*’. 
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Hence assumption (a) js true. Unlike the two previous cases, there is no sim¬ 
ple expression for 5„,3 With the help of the formula 

= tr {A{I - 7'7)r 

we may compute tma for small values of w. Thus 


bu — — 


n + 1 


^ n 2n — 1 , n 
““ ■ 2 (n+ 1)“ 


n 


^ _ 3(n - 1) , 3n(2n - 1) 

'' 71 + 1 2(n + 1)’ (n + ly 

, 3n - 2 8n - 11 , 4n(n - 1) (271 - 1)' _ 27x^(271 - 1) ^ n* 

^43 r» rt/.. I 1 \ I 


8 2(n + 1) ' (71 + 1)2 2(n + 1)“ (n + !)> (n +jl)‘ 

5(4n - 7) 57i(8n - 11) 5(271 - l)(7i - 1) 57i'(7i - 1) 

4(71 + 1) 8(71 + 1)2 2(71 + 1)2 (n + 1)2 


571(271 - 1)* 5n\2n - 1) 

-T- 


n 


4(71+1)2 2(71 + 1)2 (n + l)2 

n 271 - 1 , - 71 ^ 

71 + 1 (71+1)2 — 

Hence assumption (b) is true (for n > 10). Usmg these values of 6,„i we may 
compute and Ptix), By (1) we have 


7l T' i f—1 M 


G{x) = 4>(i) - — Ps{x)^'-’\x) + \ (Pi{x)^^*\x) + ^/5l(x)4)"’(a:)) 

71 ’ n 


- + (^.(x)$'“^(x) - (x); 34(*)$"'(x) + ^^;(x)4''”(x)) + R(x), 


where j R(x) 1 < Mn “ and M is independent of n and x. 
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NOTES 


This section is devoted to brief research and expository articles, notes on 
methodology and other short items. 


ESTIMATING THE PARAMETERS OF A RECTANGULAR 
DISTRIBUTION 

Bv A. George Carlton 
Columbia University 

1. Introduction. In this note, the range and midrange of the sample are 
shown to be a pair of sufficient statistics, and maximum likelihood estimates, 
for the true range and true mean of a rectangular distribution, exact and limiting 
distribution of midrange, range, and their ratio are derived; the “efficiencies” 
of the sample mean and median as estimates of the true mean are calculated; 
and the limiting distribution of the difference between tw'o sample midranges is 
derived. All the limiting distributions are non-normal, and the error of estimate 
is of order n~^ rather than the customary order n~^. The limiting distribution 
of midrange, and the limiting ratio of variances of the midrange and sample 
mean were given by Fisher [1]. 

f{x) and F{x) are used throughout to designate the probability density func¬ 
tion of a: and the distribution function (cumulative probability function) of x; 
the argument will also indicate the random variable being considered 

2. Exact distribution of midrange, range, and their ratio. Let ii, • ■ • , 

be a set of n independent observations on a random variable having the rectangu¬ 
lar distribution/(a:) = 1/L, {d — L/2 < .t: < 0 -f L/2), where ^ is the true mean, 
and L the true range. The minimum observation u and the maximum observa¬ 
tion V are a pair of sufficient statistics for ^ and L, as the conditional distribution 
of the remaining observations for given u and v is independent of 6 and L: 

f{xi , ,Xr,\u,v) = {v - 

The midrange 0 = -b and the range L = v — u are maximum likelihood 
estimates of 6 and L, respectively, as they are the parameter values which 
uniquely maximize f{xi, ■ ■ ■ , Xn) for the given set of observations. We shall 
assume that the random variable is normalized by change of origin and change 
of scale so that 0 = 0 and L = 1. The ]oint probability density function of u 
and V IS 

f(u. V) = 

dvd(~u) dvdi—u) 

— n(n — l)(i> — 

355 


(1) 


i-h<u<v< i). 
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Making the transformation i — ^(u ■i-v),L — v — uin (1), 

(2) M 1) = n{n ~ (0 < 2 I 0 I < 1 - L < 1 ). 

Integrating out L from 0 to (1 — 2 |9 |), 

/(e) = r (1 - 2 | 6 1)"-^ (|e|<i). 

I m - m I = ^ - K1 - 21 e D”, (I fl I < ^). 

Odd moments vanish by symmetry; even order moments are 

(4) ne“(l - 2 1 e I)"-' dl = 2"^^ j . 

In (2), integrating out 5 from |(L — 1) to i(l — L), 

/(L) = n{n - 1)L"""(1 - L), (0 < L < 1). 

ii’(S) = n(n - 1) [ -L)dL = n(n - l)B 2 ;(n - 1, 2), 

Jo 

<0 < X S 1). 

«ffi - .(„ - 1) {E--(1 - 1) JI - . 

Thus gi(Z) = (n — l)/(a + 1); hence the bias of L can be removed by multi¬ 
plying E by (n -H l)/(n - 1). 

The statistic t = e/L can be used to test the hypothesis that the mean of a 
rectangular distribution of unknown range is 0. To obtain the distribution of I 
when the hypothesis “is true, sett = 0/L and L = Z in (2): 

/«, Z) = n(n - DZ"-*, (Z < (1 -f 2 I i 1)-^). 

(6) /(O = (n - 1)(1 + 2| «j)-” 

I Fit) - F{0) I = ^ - Kl + 2 I i 1)*-”. 

Moments of i do not exist for order greater than (a — 2); for fc < n — 2, odd 
moments vanish by symmetry and 

miO = 2{n - 1) I <"‘(1 + 2<)~"d< = 2^* / (^ 2 k^) ■ 

3. Limiting di^butions. 6, L, and t have non-normal limiting distributions, 
although 6 and L are maximum likelihood estimates; this is explained by the 
discontinuity of f(x, fl) ^ i = 5 ±. We obtain the limiting distributions of 
g = nfl and r = n(l — L). Substituting g and r in (2), and proceeding to the 
limit for increasing n, 

lim /(g, r) = lim ~ n) “ (0 < 2 | g | < r < «). 
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(7) 


The necessary simple integrations yield the following limiting distributions: 

/(«) = 

I m -F(0)\ = i- ie-*'*'. 

thkiQ) = (2fc) 1/2* ; 1 = 0. 

/(r) = re~', (r > 0) 

F(r) = !-(! + r)e-', (r > 0) 

Au(r) = (fc + 1)1 

The limiting distribution of s = is the same as that of nff, as is seen by com¬ 
paring (3) and (6). 

4. Comparison of 6 with x and x as estimates of 6. The sample mean x and 
median x are unbiased estimates of 6. 


( 8 ) 


1 I* t 

fliix) = - 3? dx l/(12n). 

n 

= [yf(x)dx = (I - xrix + §)"di, 


for n = 27n. -f- 1, m an integer. Substituting z = 1 — 4S*, then simplifying the 
Beta fimction obtained on integration, 

, , ... (2m +1)1 f" nf, , 1 1 

1 


m I m 12^’"+® Jo 
(4), with A: = 1, gives miC^) = 


4(2m + 3) 4(n + 2) 

Comparison of this with (8) 


2(n + 1)(» + 2) ■ 

— ^7h 

and (9) shows that H 2 {e)/n 2 {x) = = 3n/(n + 2). 

As rt mcreases, /x2(0)/m 2(S) 6/n 0; and /ii^^x) —*■ 3 Thus the “efE- 

ciency” of the mean is zero, and the median is only one-third as “efficient” as the 
mean (The concept of efficiency is not strictly applicable as B does not have a 
normal limiting distribution.) 

6. Limiting distribution of difference between two midranges. Let Bi and 

Bo be the midranges of samples of rii and tio observations, respectively, from two 
normalized rectangular populations, and let S = qi — qi = niBi — rioOi. Apply¬ 
ing the formula for composition of random variables, one obtains from (7), 


( 10 ) 


m = - q)m<k= f “ dg 

w—00 v—QO 

= r e"*'''e“‘’dg + f'*'e"*'*'dg + f e*'‘ 
J-eo Jo J |c| 


' e-"” dg 


= ie-*''' + I z 1 e-*'*' + ie-*'*' = (| z | + J)e 
I F(z) - F(0) 1 = 1- e-*'-'. 

M2*(2) = (fc + l)(2/i!)l/2“. 


.-2UI 


.-2|«1 


-2lil 
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z = ~ — 3 _ can. be used to test the hypothesis of equality of 

2 vi - Ui 2(^2 — Ui) 

means of any two rectangular populations, and has in the limit the distribution 
(10), if the means of the populations are equal. 

6. The one-parameter rectangular distribution. If/(a:) = l/X, (0 < oi < X), 
then /(xi, • ■ , Xn I u) = Thus ii is a sufficient statistic and is evidently 

the maximum likelihood estimate of X Here F(v) = (n/X)" ,/(r) = tid”~^X'"”, 
and fikiv) = X*n/(n +■ k). The normalized error y = n{\ — v)/\ has the prob¬ 
ability density function J{y) = (1 — which tends to e"® as n increases. 

REFERENCE 

[1] R. A Fisher, “On the mathematical foundations of theoretical statistics,” Phil 
Trans Roy Soc London, Series A, Vol 222 (1021), pp 309-368 


ON THE POWER FUNCTION OF THE SIGN TEST FOR 
SLIPPAGE OF MEANS 

By John E. Walsh 

Princeton Univereity 

1. Summary, This note compares the power functions of the sign test for 
slippage with the power functions of the most powerful test for the case of nor¬ 
mal populations. The sign test is found to be approximately 95% efficient for 
small samples. 

2. Introduction. Let us consider a univariate population whose mean equals 
ite median and whose cumulative distribution function is contmuous a’t the 
mean. A sampling method of testmg the supposition that the mean of this 
population exceeds a given constant value juo (slippage to the right) is furnished 
by considering how many values of the sample are less than juo. An analogous 
method applies for testmg whether the mean is less than yo (slippage to the left). 
A particular class of populations for which the sign test is valid are the normal 
populations. This note compares the power functions of the sign test with the 
power functions of the moat powerful test for slippage for the case m which the 
population is normal (Table I). It is shown that the sign test is approximately 
95% as efficient as the most powerful test (the Student f-test) for samples of size 
4, 5 and 6, and that although the relative efficiency of the sign test decreases as 
the sample size increases, its efficiency is approximately 75% for samples of size 
13. This supports the idea that for normal populations little efficiency is lost 
by using attributes mstead of continuous variables if the sample size is small 

In choosmg between the sign and Student t-tests for slippage the following 
considerations may be of interest: 
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(a) The sign test is valid for a more general class of populations than the £-test. 

(b) The sign test is almost as efficient as the £-test for small samples from nor¬ 
mal populations. 

(c) The sign test is much more easily computed than the £-test. 

(d) The sign test has a very limited choice of significance levels for small 
samples ■while the £-test can have any desired significance level for any size 
sample. 

The considerations (a) to (d) also apply in choosing bet'ween the sign test and 
the Daly test based on (x — no)/R, where £ is the mean and R the range of the 
sample used for the test (see [1]). 

In section 6, Table II shows that for small size samples the significance levels 
of the sign test do not change greatly if the mean is only approximately equal 
to the median. 

3. Statement of sign test. Let xi, • • • , a:„ be a sample of size n from a uni¬ 
variate population whose mean equals its median and whose cumulative distribu¬ 
tion function is continuous at the mean, that is, which has the property that 

(1) Pr(x < m) = Pr(x > mJ = i 
where g is the population mean. 

The significance test to decide whether n exceeds a given constant value juo 
is defined by 

(2) If m or less of the sample values Xi, •••, Xn are less than ims , accept ii > tMi. 
The significance test to decide whether fi < uo is given by 

(3) If m or less of Xi, • ■ ■ , Xr. are greater than mo , accept ;u < mo • 

It is to be observed that in both (2) and (3) the null hypothesis tested is that 
n = Ho ■ In (2) the alternative is m > Mo and m ( 3 ) the alternative is m < Mo ■ 
From (1) it follows Immediately that (2) and (3) both have the same signif¬ 
icance level a{m, n), where 


a{m, n) 


Appropriate choices of m and n will result in values of a. (m, ti) suitable for sig¬ 
nificance tests For example 

a(0, 4) = .0624, 

a(l, 8) = 0352 

a(0, 5) = .0312, 

a{l, 9) = .0195 

aCO, 6) = .0156, 

a(l, 10) = .0107 

a(l, 7) = .0625, 

a(2, 13) = .0112. 


If the population has a continuous distribution function, Pr{xt = x, -,1 ^ j) 
= 0 In this case let X(,) be the zth largest of xi, • • , Xn . Then (2) can be 
restated as 

(4) 


If a:(m+i) > Mo) accept m > Mo. 
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Test (3) is seen to be equivalent to 
(6) If Ecn-m) < Mo , accept m < Mo . 

Thus for the case of populations with continuous distribution functions it is 
only necessary to determine one order statistic and compare it with no in order 
to apply a test. 

It IS to be observed that a particular class of populations which satisfy (1) are 
those which have distribution functions which are symmetrical and contmuous. 
Thus the normal populations represent a par-ticular class for which (4) and (5) 
are valid. 


4. Comparison with Student t-test. Consider the case m which the popula¬ 
tion is normal with mean p and variance <r*. Then the power function for (4) 
IS given by 


Power Function = Pr(x(m -\« > mo) 


= pr(i 




) 


j»I(n 


-"i-Di /. (£/*> ■'») (/. *) 


where 


1 -Ji/» 


Kv) = —7^ c 
V 2 % 


and 6 = 


Mo — M 


For a normal population, however, it is well known that the most powerful 
Studentized test of the one-sided alternative p > no is the appropriate Student 
i-test. Values of the power function for the i-test are found for given values of 
6 by using the normal approximation given in [2], 

The method of measuring the relative efficiencies of the two types of tests ivill 
be different from the common method of measurmg the relative efficiencies of 
estimates, which consists in taking the ratio of the variances of the two esti¬ 
mates as the measure of their relative efficiency. The principle followed here 
will be to consider a sign test based on a given sample size and vary the degrees 
of freedom of the /-test having the same significance level until the power func¬ 
tions of the sign test and /-test agree in the sense that in the half-plane 3^0 
the area between the two power curves for which the sign test power function 
exceeds the /-test power function is equal to the analogous area for which the 
sign test power function is less than the /-test power function. The considera¬ 
tions are limited to the half-plane 5^0 because the test is one-sided. The size 
of the /-test sample having this property divided by the size of the sign test sam¬ 
ple is called the relative efficiency of that sign test. Intuitively this relative 
efficiency measures how much more data must be added if the sign test is to 
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furnish an amount of information equivalent to that supplied by the i-test. In 
obtainmg the relative efficiencies in the manner described above, the degrees of 
freedom of the ^test are allowed to assume fractional values and the values of 
the power function are computed using the normal approximation as if it were 
valid for fractional degrees of freedom. The number of degrees of freedom, of 
course, can only be integral This method, however, gives an interpolated 


TABLE I 

A comparison of the power functions of the sign and t tests 


Test 

m 

n 

Approx¬ 

imate 

Relative 

Efficiency 

Significance 

Level 

Values of Power Function 

5"*—i 

5--1 

«=-ll 


t 


3.8 


.0624 

.219 

.484 

.755 


sign 

0 

4 

95% 

.0624 

.229 

.500 

.765 


t 


4.8 



.150 

.402 


.909 

sign 

0 

5 

96% 


.159 

.420 

H 

.888 

t 


6.7 


.0156 

.098 

.330 

.660 

.899 

sign 

0 

6 

95% 

.0166 

.110 

.355 

.655 

.863 

t 


5.6 


.0625 


.696 

.932 

.995 

sign 

1 

7 

80% 

.0625 

.311 

.711 

.920 

.988 

t 


6.4 


.0352 

.225 

.619 

.908 

.989 

sign 

1 

8 

80% 

.0352 

.239 

.630 

.869 

.978 

mm 


7.4 



.171 

.665 

.893 

.988 

WEM 

1 

9 

82% 


182 

.573 

.879 

.974 

t 


8 


.0107 

.117 

.468 

.848 

.983 

sign 

1 

m 

80% 

.0107 

.137 

.515 

853 

.964 

■i 


9.75 


mmm 

.162 

■1 

.950 

.998 


2 

13 

75% 


.165 

■1 

;949 

.998 


measure of the size sample of the i-test having the properties outlined above. 
Table I supplies a comparison of the relative efficiencies and the powers of the 
sign test and the <-test obtamed in the manner just described. Thus for samples 
of size 4, 5 and 6 the sign test is approximately 95% as efficient as the Student 
f-test. The relative efficiency decreases as the size of the sample increases but 
even for samples as large as 13 is approximately 75%. 
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For normal populations it is also well known that the most powerful Student- 
ized test of the alternative /; < juo is given by the appropriate Student 4-test. 
It is clear that Table I can also be considered as a comparison of test (5) with 
the corresponding Student 4-test if S is replaced by -5 and mhyn — m. 

6. Approximate cases. Suppose that (1) is only approximately satisfied by 
the population in question. 

Let Pr{x < ;u) = ^ + r. Then the significance level of (2) is 

m I 

Significance levels of (2) for small size samples are given in Table II as a func¬ 
tion of r. 


TABLE II 

A comparison of the significance levels of the sign test when the mean diners from 

the median 


m 

1 ^ 

Significance Level 

r=>0 

r-=-.02 


r** 02 

r=.06 

0 


HI 

.073 




0 



.038 


■■ 


0 



.020 


.012 



Table II shows that for small samples the significance level of (2) does not change 
greatly from a(m, n) if (1) is only approximately satisfied. Expression (6) 
shows, however, that for large size samples even a small value of r can cause a 
large change in the significance level of (2). 

For Fr[x < n) = | r it is apparent that the significance level of (3) is (6) 
with r replaced by -r so that Table II applies to tests (3) if this replacement is 
made. 
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AN APPROXIMATION TO THE PROBABILITY INTEGRAL 


By J. D. Williams 

United States Naval Ordnance Test Station, Inyokem, California 
1. Summary. It is shown that 

and that the equality is never in error by as much as three-fourths of one percent. 
Other approximations are discussed. 


2. For use on those occasions when an approximate analytic expression for 
the integral 


( 1 ) 


p(x) = 


1 

'\/2x 



dt 


is desired, the approximation 

(2) p' (x) = [1 - 


is simple and reasonably accurate. An approximation equivalent to this is 
quite commonly used in problems involving a bivariate normal distribution, 
but its use in the one-dimensional case seems to be less well known. 

We shall first show that p(x) < p'(x) and then estimate, by calculation, 
the relative error made when the equality is accepted. 


(3) 


|_2ir Jo Jo 


e *** dt 


i2r 


re 


•ir* 


dr dd 


= [1 _ = p>(x), q.e.d. 


The approximation, introduced at the stage of passage to polar coordinates, 
comprises replacement of the square region of integration — x < x, < x by a 

2 

circular region, 0 < r < x, having the same area. Since we are dealing 

with a circular normal distribution with zero means, the region of fixed area 
which covers the greatest density is a circle whose center is at the origm. 
Therefore our square region of area 4x’ must contain less density than the cir¬ 
cular region of area 4x'* by which we have replaced it. 

The maximum value of the relative error, 


*p 


= 

p(x) 


- 1 , 
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is found by calculation to be about Beven-tenths of one percent, as may be judged 
from Table 1, column 3. 

The question may be asked: Can the relative error be reduced by suitable 
choice of the parameter c in 

(5) p'Cx) = [1 - 

Calculation indicates that by taking c = 0.6302 the relative error is reduced to 
about one-half of one percent; but this gain is offset, for many purposes, by the 
loss of the inequality (3), 

The density function implied by (2), namely 

( 6 ) p'(x) = 1 ^' 

IT 

has the variance 

(7) = T (1 - log 2) = 0.964. 

If c is determined so that the density function will have unit variance, then 
(5) becomes 

(s, 

this approximation to (1) leads to relative errors of almost two percent, which 
occur when x is small. 

The density function (6) may be used to judge the quality of (2) in approxi¬ 
mating to an integral of the form 

(9) p(xi , * 2 ) = f e"*'* di, 

V 22r ■’*1 

the approximation being 

(10) p' (ii, * 2 ) = i b' (a:?) — P' (a:i)] 

when xi and X2 are positive (which is the severe case). It is evident that the 
relative error in accepting (10) for (9) cannot exceed the greatest relative dis¬ 
crepancy tp, in the interval Xi < a; < X 2 , betiveen density function (6) and the 
normal density 

(11) 

The quantity 


( 12 ) 


_ p'(.x) 

" P(x) 


- 1 


is tabulated in Table 1, column 6, from which it appears that the relative error 
committed in using (10) for (9) will surely be less than one-and-a-half percent 
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provided 0 < a:, < 1.8; but the relative error may be very great when the inter¬ 
val of integration lies beyond x = 1.8. 

The approximations described herein were suggested by the following situa¬ 
tion, encountered in work done by the Applied Mathematics Panel, NDRC: 
The probability P of at least one success, defined by — i < a:, < x, in a sample 


TABLE 1 


X 

p'(x) 

p(x) 


P'(*) 

p(*) 

*p 

.0 

0 

0 


.3989 

.3989 

0 

.1 

.0797 

.0797 


.3969 

.3970 

.0005 

.2 

.1586 

.1585 


.3914 

.3910 

.0010 

.3 

.2360 

.2358 

.0008 

.3821 

.3814 

.0018 

.4 

.3112 

.3108 

.0013 

.3695 

.3683 

.0033 

.5 

.3836 

.3829 

.0018 

.3539 

.3521 


.6 

.4526 

.4515 

.0024 

.3356 

.3332 


.7 

.5177 

.5161 

.0031 

.3151 

.3123 


.8 

.5785 

.5763 

.0038 

.2929 

.2897 


.9 

.6347 

.6319 

.0044 

.2695 

.2661 



.6862 

.6827 


.2454 


.0141 

1.1 

.7329 

.7287 


.2211 

.2179 

.0147 

1.2 

.7747 

.7699 


.1971 

.1942 

.0149 

1.3 

.8118 

.8064 


.1738 

.1714 

.0140 

1,4 

.8443 

.8385 

.0069 

.1516 

.1497 

.0127 

1.5 

.8725 

.8664 

.0070 


.1295 

1 9 

1.6 

.8967 

.8904 

.0070 

.1113 

.1109 


1.7 

.9171 

.9109 

.0068 



^9nS ! 9 

1.8 

.9341 

.9281 

.0065 




1.9 

9485 

9426 

0063 


.0656 


2.0 

.9600 

.9545 

.0058 


.0540 

BEI3 


of n pairs (xi, xj) from a population in which the independent component prob¬ 
abilities are p(x), is 

(13) P = 1 - [1 - p*(x)]". 

A little numerical exploration, supplemented by examination of the limiting 
values as X —»■ 0 and x —+ oo, revealed that when P is fixed the quantity log n is 
very nearly a linear function, of slope minus two, of log x; so nearly, in fact, 
that one was encouraged to posit the linearity and observe the consequences 
This yielded (5), which became (2) by requiring that it go to zero with x in the 
same manner as (1). 
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DISTRIBUTION OF THE RATIO OF SAMPLE RANGE TO SAMPLE 
STANDARD DEVIATION FOR NORMAL AND COMBINATIONS 
OF NORMAL DISTRIBUTIONS 


By G. A. Baker 

College of Agriculture, Universily of California at Davis 

1, Introduction. Tho distribution of sample ranges in terms of the stand¬ 
ard deviation of the sampled population for homogeneous populations has been 
dealt with in some detail by mathematical methods for the normal parent and by 
empirical sampling methods for non-normal parents. These results are pre¬ 
sented in summaiy in Tables XXII, XXIII, and XXIV of [1]. Bliss [2] suggests 
that the range in different sized samples from a normal parent at various levels 
of significance, in terms of the standard deviation computed with varying degrees 
of freedom, would be a valuable table. It is not clear whether he means that 
the standard deviation is to be estimated from the same sample as the range or 
from a second Independent sample, as is done by Newman [3], Pearson and 
Hartley [4], and Hartley [5]. 

In natural hybridization of distmct types of plants and subsequent back cross¬ 
ing with parental types distinctly bimodal populations may develop. Heiser 
[6] has described such a situation for sunflowers. Similar situations may occur 
in natural and artificial crossing of peaches and apricots as shown by the work of 
Hesse [7] of this station. In studying such genetical material it often would be 
helpful to know the expected distributions of»the sample ranges in terms of the 
sample standard deviations estimated from the same sample for certain typical 
nonhomogenouB populations. Applications to such data wili be published 
elsewhere. 

Since the mathematical situation for the distributions of the sample range 
{R) in terms of the sample standard deviation (s) appears somewhat complex, 
empirical sampling methods were resorted to for obtaining the distributions for a 
normal parent {N), a symmetrical distinctly bimodal nonhomogeneous parent 
(A), and a weakly bimodal but strongly skewed parent (B). Populations A and 
B are pictured in charts A (p. 341) and B (p. 348) of [8]. 

Population N is approximately represented by 

1296 , (X - 15.6)’ 


population A by 


(^) 


648 


5V2 


8 / ,{X - 15.5)’ , , (X - 32.5)’ 

^ ^exp. - ^ -I- exp. ~ p 


25 


25 


). 


and population B by 
972 


(B) 


5\/2 


2 / ,iX- 15.5)’ , , 1 (X - 31.5)’\ 


25 
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The method of drawing samples is the same as that originally described in [9]. 
N, A, and B each have a total area of 1296. Thus, 1296 integers distributed 
over a proper range and with the frequencies indicated by the corresponding 
areas under the curves N, A, and B were entered on charts with 6 big rows and 
6 big columns of squares which were subdivided into 6 little rows and 6 little 
columns. In each case the 1296 integers were distributed in a non-systematic 
way among the 1296 little squares. By throwing 4 differentiated dice (one die 
assigned to a big row, one to a big column, one to a little row, and one to a little 
column) it was possible to draw random individuals from populations that are 
approximately N, A, and B. 

Fisher [10] has defined gi which measures the skewness of a distribution and gi 
which measures the flatness. These g’s are equivalent to the square root of ft 
and ft — 3, respectively in Karl Pearson’s older notation. For population A, 
Pi = 0 and p 2 = — 1.10. For population B, pi = 0.62 and = —0.29. 

TABLE 1 


Distribution of range in terms of sample standard deviation for samples of specified 
sizes from a normal parent population (N), Pi = 0, Ps = 0 


Sample 

Size 

Number 

of 

Samples 

Mean 

Standard 

Devia¬ 

tion 

Si 

Standard 
Error of 
(Normal) 

Cs 

Standard 
Error of gt 
(Normal) 

2 


1.4142 

0.0 



0 0 


4 

mgm 

2.2238 

0.1564 



0.434 

0.1400 

16 


3.6112 

0.3879 

0.115 


0.135 

0.2783 

36 

135 

4.4014 

0.6076 



0.332 

0.4142 

64 

76 

4.8272 

0.6409 

0.492 

0.2756 

-0.751 

0.5448 

100 

48 

5.1216 

0.6616 



1.038 

0.6744 


2. Empirical random sampling results. The sample sizes considered are 2, 4, 
16, 36, 64, 100. The distribution functions for various sample sizes are char¬ 
acterized by givmg means, standard deviations, pi’s, and g^’s. The results are 
given in Tables 1, 2, and 3. The standard deviations of the samples were com¬ 
puted by dividmg the sum of squares by one less than the number in the sample. 
When the size of the sample is two then the range divided by the standard devia¬ 
tion of the sample is always a constant, square root of 2. 

The constants for the distributions for all sample sizes except four were com¬ 
puted without grouping. The constants for the distributions for samples of 
four were computed from grouped data with a small class interval. 

3. Discussion. The mean values of the range divided by the standard devia¬ 
tion of the sample for population A run lower than for populations N and B. 
The standard deviations of the distributions for all parents increase from zero 
and continue to increase throughout the range considered for population N. 
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The standard deviations cut down much more quickly for population A than 
for population B. The values of gi and pj show that the distributions are sig¬ 
nificantly non-normal for certain sample sizes but perhaps not seriously so for 
other sample sizes. 

The distributions of range divided by the sample standard deviation are quite 
different from the corresponding distributions of range in terms of the standard 
deviations of the population as can be seen by reference to the tables in [1]. 

TABLE 2 


Distribution of range in terms of sample standard deviation for samples of specified 
sizes from a bimodal symmetrical population (A), = 0, g^ = —1.10 


Sample 

Site 

Number 

of 

Samples 

Mean. 

Standard 

Devia¬ 

tion 


Standard 
Error of o, 
(Normal) 

ffi 

Standard 
Error of ot 
(Normal) 

2 


1.4142 

0.0 

0.0 




4 

1040 

2.2060 

0.1551 

-0.468 

0.0768 


0.1516 

16 

269 

3.6742 

0.5283 

1.026 

0.1514 

1.182 

0.3015 

36 

115 

4.0690 

0.4604 

0.561 

0.2256 


0.4474 

64 

64 

4.3194 

0.3377 

0.106 

0.2993 

-1.829 

0.5906 

100 

41 

4.4846 

0.3194 




0.7246 


TABLE 3 

Distribution of range in terms of sample standard deviation for samples of specified 
sizes from a skewed bimodal population (B), gi = 0.6S, gz = —0.29 


Sample 

Size 

Number 

of 

Samples 

Mean 

Standard 

Devia¬ 

tion 


Standard 
Error of m 
(Normal) 

i 

ffi 

Standard 
Error of a 
(N ormal) 

2 


1.4142 


0.0 

1 

1 

0.0 


4 

1061 

2.2268 

0.1459 

-0.470 

0.0751 

-0.142 

0.1600 

16 

265 

3.9277 

0.6938 

0.640 

0.1496 

0.405 

0.2982 

36 

117 

4.4792 

0.5476 

0.400 

0.2236 

0.018 

0.4437 

64 

66 

4.8485 

0.5249 

0.634 

0.2960 

1.028 

0.5906 

100 

42 

5.0481 

0.3626 

-0.092 

0.3655 

-0.632 

0.7166 


At the suggestion of the referee it is noted that the empirical results for the 
means in Table 1 are rather well approximated by B{R)/E[s). It is necessary 
to remember that E(s) a for small samples. For a discussion of E{,s) see 
Kenney [11] equation 28, page 136. 

It is also noted that if 

X = log (log sample size — log 2) 
y = log ^mean — ■^/2^ 
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then the plots of the (X, Y) values in each case are approximately straight lines 
for the present range in sample sizes. 

The standard deviation and range when determined from the same sample 
are correlated. For the normal population this correlation decreases and prac¬ 
tically disappears for samples of 100 or greater. This is not true for populations 
A and B. For these populations the correlation between sample range and 
sample standard deviation decreases much more slowly and seems to be of the 
order of 0.5 for samples of 100. 
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NEWS AND NOTICES 

Readers are invited to submit to the Secretary of the Institute new items of interest 

Personal Items 

Dr. Theodore W. Anderson., Jr. of the Cowles Commission for Economic Re¬ 
search has been awarded a Guggenheim Memorial Foundation Fellowship. 

Assistant Professor Theodore A. Bancroft of Iowa State College has been 
appointed to an associate professorship at the University of Georgia 

Dr. Z. W. Birnbaum is now an associate professor in the mathematics depart¬ 
ment at the University of Washington. 

Mr. Albert H. Bowker and Mr. Edward Paulson, formerly with the Statistical 
Research Group, of Columbia University, have been awarded pre-doctoral 
fellowships in mathematical statistics by the National Research Council. They 
are now studying at Columbia University. 

Mr Oscar K. Buros, of Rutgers University, is Review Editor of the Journal of 
the American Statistical Association. He is making the review section a very 
important part of the Journal with such features as replicating reviews and biblio¬ 
graphies of statistical methodology. Members of the Institute who are authors of 
papers and books (both English and non-English) on statistical methodology are 
urged to send a reprint, review copy, or bibliographic information to Mr. Buros 
as soon after publication as possible. 

Professor Harold Cramer, Director of the Institute of Mathematical Statistics 
at the University of Stockholm will be a visiting professor at Princeton Uni¬ 
versity during the fall semester of the 1946-1947 academic year. He will give 
a course of graduate lectures on the theory of probability. 

Dr. J, H. Curtiss has been appointed assistant to the Director of the National 
Bureau of Standards, where his duties will include the administration of the math¬ 
ematical and statistical activities of the Bureau. Dr. Curtiss served in the U. S 
Naval Reserve during the war, and recently received a Commendation Ribbon 
from the Secretary of the Navy for his work in statistical engineering for 
the Bureau of Ships and the Office of the Comraander-in-Ghief. He will con¬ 
tinue to be on leave of absence from Cornell University throughout the academic 
year 1946-1947 Administrative direction of the Mathematical Tables Project 
of theNational Bureau of Standards has been assigned to Dr. Curtiss. Members 
of the Institute are cordially invited to visit the Project when in New York City, 
and to confer with the Project Director, Dr. Arnold Lowan, concerning their 
computational problems. The address of the Project is 150 Nassau Street, New 
York City. The Project is currently supported by funds transferred to the Bu¬ 
reau from the Office of Research and Inventions of the Navy Department An 
Advisory Panel of mathematicians interested in the computation of tables is 
being formed to define the long range program of the Project. An announce- 

370 



NEWS AND NOTICES 


371 


ment as to the personnl of this panel will appear in a later issue of the Annals. 

Assistant Professor W. J. Dixon of the University of Oklahoma has been 
appointed to an associate professorship at the University of Oregon 

Dr Hallett H. Germond has returned from war service to his teaching duties 
in the Department of Mathematics at the University of Florida 

Dr. Earl L. Green, has accepted a position as Associate Professor of Zoology 
at Ohio State University. 

Mr. John C. Hintermaier, formerly supervisory chemist with the Forstmann 
Woolen Company of Passaic has accepted a position as chief chemist of the Van¬ 
ity Fair Mills at Reading. 

Mr Wilham Hodgkinson, Jr , has returned from war service to his position 
with the American Telephone and Telegraph Company at New York. 

Mr, ;R,obert H. Hoskins, discharged from the Navy in March, is employed in 
the Actuarial Ordinary General Division of the John Hancock Mutual Life 
Insurance Company at Boston. 

A testimonial dinner was given to Professor Harold Hotelling on May 3, 1946 
at the Columbia University Men’s Faculty Club as a farewell by the Statistical 
Techniques Group, New York Chapter, American Statistical Association. 
Professor Hotelling is leaving Columbia at the end of the academic year to be¬ 
come Professor of Mathematical Statistics at the University of North Carolina. 
Professor Helen M. Walker, on behalf of the Group, presented gifts to Professor 
and Mrs. Hotelling. The Chairman, Professor Irving Lorge, introduced the 
distinguished visitoi s who came to honor Professor Hotelling Among the speak¬ 
ers were Professor P. C. Mahalanobis of Presidency College, Calcutta, India, 
Dr. Stuart Rice, Chairman of the Statistical Commission of the Economic and 
Social Council of the United Nations, and Dean Pegram of the Graduate Facul¬ 
ties of Columbia University. Professor Hotelling reviewed the changes in sta¬ 
tistical theory and techniques that were developed during the 15 years of his 
professorship at Columbia University. 

Mr. Calvin J. Eirchen, who has recently accepted a position with the technical 
department of Remington Arms Company at Bridgeport, Conn., addressed the 
Rochester Society of Quality Control Engineers on Sept. 17 on "The Applica¬ 
tions of Sequential Analysis to Acceptance Inspection”. 

Dr. Walter Leighton of the Rice Institute has been appointed to a professor¬ 
ship at Washington University. 

Miss Dorothy Marrow has been appointed to an assistant professorship at 
George Washington University 

Professor D E. Morton of the National Bureau of Econlmic Research is 
joining the faculty of Cornell University 

Assistant Professor Cecil J. Nesbitt of the University of Michigan has been 
promoted to an associate professorship. 

Dr. A. C. Olshen has accepted a position as Actuary of the West Coast Life 
Insurance Company at San Francisco. 
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Mr. William B. Rice has opened an office as Consulting Business Statistician 
at 1011 South Los Angeles Street, Los Angeles. 

Mr. John Salerno, formerly a draftsman (statistical) with the War Department 
is now Mathematician with the U. S. Coast and Geodetic Survey, 

Assistant Professor Henry Scheff4 of the Mathematics department of Syracuse 
University has been appointed associate professor of engineering at the University 
of California at Los Angeles, Professor Scheff4 has been awarded a Guggenheim 
Memorial Foundation Fellowship. 

Mr. William B. Simpson has returned from overseas and is attending the Uni¬ 
versity of Chicago 

Professor Geoge W. Tyler has returned to his position in the Mathematics 
Department at Virginia Polytechnic Institute, having spent two years at the 
University of California Division of War Research. 

Professor W. Allen Wallis, who returned to his position at Stanford University 
in April after serving for nearly four years as Director of Research with the 
Statistical Research Group of Columbia University, has accepted a position as 
Professor of Statistics and Economics in the School of Business of the University 
of Chicago effective September 1, 1946. 

Mr. Frank A. Week who served during the war as a Captain in the Office of the 
Surgeon General is now m the Actuarial division of the MetropoHtan Life In¬ 
surance Company. 

The University of Pennsylvania held a conference on ‘‘Measurement of Con¬ 
sumers Interest” at Philadelphia on May 17-18, 1946. This conference was 
sponsored by the Departments of Philosophy, Psychology, Statistics, Marketmg, 
and Foreign Commerce. Among the speakers were the following members of 
the Institute; Professor L. L. Thrustone of the University of Chicago, Professor 
Louis Guttraan of Cornell University, Dr. W. Edwards Deming of the Bureau 
of the Budget, Professor C. West Churchman of the University of Pennsylvania, 
Dr. John H. Curtiss of the National Bureau of Standards, Professor Paul Peach 
of the University of North Carolina, and Professor S. S. Wilks of Princeton Uni¬ 
versity. 

The followmg four doctorates, with mathematical statistics as a major subject, 
were conferred during 1945 in the United States. The name, University, month 
in which the degree was conferred, and the title of the dessertation are given in 
each case; 

T. W. Anderson, Jr., Princeton, June, ‘‘The Non-Central Wishart distribution 
and its application to Problems in Multivariate Statistics.” 

Frances Campbell, Michigan, June, “A Study of Truncated Bivariate Normal 
Distributions.” 

W, M. Chen, California, June, ‘‘Power Function of the Analysis of Variance and 
Convariance of a Normal Bivariate Population.” 

J. J. Livers, Michigan, February, “Use of Partitions in Multivariate Moment 
Sampling Theory.” 



NEWS AND NOTICES 


373 


Professor A. R Crathorne of the University of Illinois, a Fellow of the In¬ 
stitute and one of its founders, died on March 7, 1946 at the age of 72 


Announcement of New preliminary Actuarial Examinations 

On June 7, 1947, three new Preliminary Actuarial Examinations wih be given 
to undergraduate students of mathematics and others who may be interested in 
going into the actuarial profession. These new examinations are sponsored 
jointly by the Actuarial Society of America and the American Institute of Ac¬ 
tuaries. 

The new series of exammations will replace Parts 1, 2, and 3 of the actuarial 
examinations which have been given heretofore, but will carry the same credit 
toward Associateship in the two actuarial organizations. These examinations 
have been prepared under the direction of a joint committee of actuaries and 
mathematicians. They will be administered by the College Entrance Examina¬ 
tion Board at centers throughout the United States and Canada. 

Descriptions of the three new examinations are as follows: 

1. Language Aptitude Examination. This is a three-hour aptitude examina¬ 
tion testing reading comprehension and precise knowledge of the meaning of 
words. It is similar to the well-known Scholastic Aptitude Test of the College 
Entrance Examination Board, except that it is pitched at approximately the 
college sophomore level. Verbal facility and command of the English language, 
as well as mathematical ability, are important in the actuarial profession. This 
is not the type of an examination for which sjiecific preparation can be made; 
it is an aptitude rather than an achievement examination. 

2. General Mathematics Examination. This is a three-hour achievement 
examination on material usually covered in the first two years of mathematics 
in colleges and universities in the United States and Canada. More speci¬ 
fically, it is based on college algebra, trigonometry, analytical geometry, and 
differential and integral calculus. It is designed to be taken by the mathe¬ 
matically talented undergraduate at the end of his sophomore year, although 
it is not restricted to this group. 

3. Special Mathematics Examination. This is a three-hour achievement 
examination based on the material usually covered in undergraduate courses 
in finite differences, probability, and statistics. It is designed to be given at 
the end of the junior or senior year to college mathematics majors who have 
either taken courses or done concentrated reading in these fields, but it is not 
restricted to this group. 

The two actuarial bodies will jointly award one $200 and eight $100 prizes 
to the nine highest-ranking contestants on the basis of performance on the first 
two of the examinations described above. In determining these awards the 
General Mathematics Examination will be weighted twice as much as the 
Language Aptitude Examination. 
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Information regarding these new examinations, and applications for taking 
them, may be obtained from either of the following organizations: 

The Actuarial Society of America 
393 Seventh Avenue 
New York 1, New York 

The American Institute of Actuaries 
720 North Michigan Avenue 
Chicago, Illinois 


Announcement of Cowles Fellowships for Women 

TwoSarahFrancesHutchinsonCowlesFellowships forwomenwill be awarded 
by the University of Chicago for the academic year 1947-48 upon nomination by 
the Cowles Commission for Research in Economics. Applicants must be stu¬ 
dents of outstanding promise, preparing for the degree of master or doctor in the 
field of social sciences and statistics, preferably in quantitative economics or 
mathematical statistics. The Fellowships amount to f1000 each, but may be 
supplemented by an additional grant of $500 if the work of the Fellowship holder 
lies within the Cowles Commission’s field of interest. Holders will be expected 
to be in residence at the University of Chicago. Application and supporting 
documents must be filed before March 1,1947. Application blanks and further 
particulars may be secured from the Cowles Commission for Research in Eco¬ 
nomics, The University of Chicago, Chicago 37, Illinois, U. S. A. 


New Members 

Tfte following persons have been elected to membership in the Institute: 

Alger, Philip L., M.S. (Union) Staff Ass’t to Mgr of "Eng , Gen. Elec Co Schenectady, 
N y., 1758 Wendell Ave , Schenectady 8, N. Y. 

Baer, Prof. Relnhold Ph.D (Gottingen) Dept of Math. U. of 111., Urbana, Ill 

Behrends, Stanley George, Li B, (La Salle) Ass’t. Purchasing Agent, 4S9-6Bth St, Oak¬ 
land, 9, Calif. 

Benford, Frank, JB.E.E. (Michigan) Physicist, 164S Rugby Rd., Schenectady^, N. Y. 

Burke, H. D., Chief of Inspection and Qual. Control, The Coleman Co Ino., Wichita 1, 
lUinsas. 

Church, Assoc. Prof. Randolph, Ph.D. (Yale) Postgrad. School, U. S. Naval Academy, 
Annapolis, Md , 318 N. Olen Ave,, Annapolis, Maryland. 

Delhi, Douglas George, M.A. (Drake) Statistician, Tuberculosis Control Div , U S, 
Public Health Service, 3896 Porter St, N W, Wash. 18, D C 

Dlmsdele, Bernard, Ph.D. (Minnesota) Instr Purdue U., 4^4 Washinglon Ave., Glencoe, 
III. 

Eaves, James C., M.A. (Kentucky) Instr. Math Dept, of U of N. C., Chapel Hill, N. C 

Elveback, Lillian R., B A. (Minnesota) Instr Biostatistics Dept., School of Public Health, 
Columbia Univ , 600 W. 168th Si., N Y. 38, N. Y. 

Harris, Theodore E., B.A (Texas) Student, Graduate College, Princeton, N. J. 
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Hlrsch, Warren M., B B.A. (New York) Teacher-NYC High School System, i791 Univ 
Ave., Bronx, 'N Y. 

Hughes, Harry M., M A. (Texas) Coat Accountant, Maritime Commission, 1454 Bancroft 
Way, Berkeley 2, Cahf. 

Jaramlllo, Trinidad J., Ph D (Chicago) Research Mathematician, 1947 So Kedste Ave , 
Chicago 23, III, 

Jones, Warren E., B A. (Maryville) Owner and Pres, of Management Controls, 699 Rose 
Ave., Des Plaines, III. 

Kallnowskl, Walbert, Graduate Student in Math, and Statistics, 3689 W. Pine Blvd , St. 
Louis 8, Mo. 

Keeney, Roger D., A.B. fBucknell) Actuarial Clerk, Metropolitan Life Ins Co., N. Y., 
N. Y., 110 Fournier Crescent, East Paterson, N J 

Kelslar, Evan R., Ph D (Califorma) Instr., Princeton U., also Research Assoc. College 
Entrance Exam. Board, Nassau Club, Princeton, N, J 

Keppler, Wharton Fields, B.A, (Ohio State) Math Statistician, M & R Dietetic Lab , 
Inc , S E Long St , Columbus 16, Ohio. 

Kubls, Assoc. Prof. Joseph F., Ph D (Fordham) Dept, of Psychology, Fordham U. Grad. 
School, N. Y., N. Y 

Leepln, Peter, Ph D. (Basle) Actuary-Basler Life Ins. Co., Gellerstr. 6%, Basle, Switzer¬ 
land 

Lefever, Prof. David Welty, Ph D. (S. California) Dept of Education, U. of S Calif., 
University Park, Los Angeles, Calif. 

Likert, Rensis, Ph.D. (Columbia) Head of the Div. of Program Surveys, B.A E Dept, 
of Agriculture, Washington, D C 

Marks, Ell S., Ph.D. (Columbia) Principal Business Economist, OPA Wash., D C , 3711 
Horner Place S E , Washington 20, D C. 

Martin, Prof. William Ted, Ph D (Rlinois) Dept, of Math., Syracuse U., Syracuse 10, 
N. Y 

McGann, Paul Williamson, A.B (Brown) Acting Section Head, Bldg., Material Equip 
Constr Price Div , OPA, 2700 Wisconsin Ave , N. W., Washington 7, D. C 

Michael, William Burton, M S. (S. California) Lecturer in Math. Psychology, Education, 
388 So Oak Ave., Pasadena 8, Calif. 

Muench, Prof. Hugo, Dr P.H. (JH.U ) Dept, of Biostatistics, Harvard School of Pub. 
Health, 55 Shattuok St , Boston 15, Mass 

Murphy, Barbara M., Librarian, of Raytheon Mfg Co , Power Tube Div , Foundry Ave., 
Waltham 54, Mass 

Murray, Janet H., A.M (Stanford) Asst Head-Family Economics Div , Bureau Human 
Nutrition and Home Economics, U. S Dept of Ag , 1025 Connecticut Ave , Washington 
6, D C. 

Nemmers, Frederic E., M.S. (Iowa) Instr , U of Wisconsin, 2936 N. Hackelt Ave , Mil¬ 
waukee 11, Wiscon’sin 

Neurdenhurg, M. G., D.P.H (Amsterdam) Head of the Bureau of Business-Control and 
Statistics of the Municipal Health Dept of Amsterdam and Honorary secretary of the 
General Netherlands Society for Public Health and Social Medicine, Frans Van Mter- 
issiraat 134, Amsterdam Zuid 1, Holland. 

Noel, Roland H., M S (Massachusetts Col. of Pharmacy) Special Asst, to Production 
Mgr Pemcillin Div., Bristol Labs. Inc , Thompson Rd., Syracuse, N Y. 

Nordquist, JohnM.,M.S. (Oklahoma) Research Asst. Seismologioal Lab. 220 N SanRafael 
Ave., Pasadena 2, Calif 1695 Corson St., Pasadena 4, Calif. 

O'Connor, Howard J., M.A. (Toronto) Technical Asst., Development Div. Umon Car¬ 
bide and Carbon Research Labs. Inc., 137-47th St, Niagara Falls, N. Y , 1016 Cleveland 
Ave , Niagara Falls, N. Y. 

Odle, John W., Ph.D. (Michigan) Head, Math. Sec., Research and Development, Naval 
Ordnance Test Station, Inyokern, Calif 
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Pascua, Asst. Prof. Maxcelino, M D, (Madrid) Dopt of Biostatistics, Johns Hopkins 
Univ., 615 N. Wolfe St., Baltimore, 5, Md. 

Perlsteln, Mae, B.A. (Hunter) Teaching Asst. U of Calif., iJfll Durant Ave., Berkeley 4, 
Calif. 

Perrott, Major Ivan Brian, M.A. (Oxford) R. Signals B A.O.R., If Widney Manor fld., 
Solihull, Warmckshire, England, 

Price, Prof. Griffith Baley, Ph.D. (Harvard) Dept, of Math., 206 Frank Strong Hall, U. of 
Kansas, Lawrence, Kansas. 

Reid, David Buchanan William, B A. (McGill U,-Montreal), Graduate Student-atatiatios, 
V.P.I., P. 0. Box 4Bt, Blacksburg, Virginia. 

Reynolds, John Hughes HI, M.A. (U. of the South) Technical Control Statistician, Cela- 
nese Corp of America, Tubize Div , Rome, Georgia. 

Reynolds, William A., M.A (California) Research Associate, National Broadcasting Co., 
30 Rockefeller Plaza, New York 20, N. Y. 

Salerno, John, B A. (Brooklyn) Draftsman (Statistical), BSD Lincoln Ave., Brooklyn 8, 
N. Y. 

Shaw, Byron T., Ph D. (Ohio State) Principal Agronomist, Plant Industry Station, Belts- 
ville, Maryland. 

Shephard, Asst. Prof. Ronald W., Ph D. (California) Dept, of Math., Purdue U., 
Lafayette, Ind. 

Simms, Clifford Raymond, M.S, (Michigan) Conaulting Actuary, 1028 Connecticul Ave., 
N. W,, Washington, D. 0 

Sprengel, Herbert J., MS. (Illinois) Quality Control Engineer, 808 N. Lombard Ave., 
Oak Park, 111. 

Stein, Charles M., B S (Chicago) Student, Columbia U,, 100-SS Colfax St , Si, Albans, 11, 
N. K 

Stlbltz, George R., Ph D. (Cornoll) Consultant in Applied Mathematics, S9S S. Prospect 
St, Burlington, Vermont, 

Stone, John Richard Nicholas, M.A (Cambridge) Director of the Dept, of Applied Eco¬ 
nomics, U. of Cambridge, England, King’s College, Cambridge, England 

Studley, Duane Morton, Associate in Arts (Colorado) Clerk HQ J6th AF, 1311 Cheyenne 
Blvd , Colorado Springs, Colorado. 

Tweedy, Marjorie A. L., B.S. (Ohio State) Economist, Office of Price Adm., 1417 N, St 
N W., Washington, D C 

Updike, Arthur Thomas, Manager, Quality Control Dept. U S. Naval Ordnance Plant, 
Indianapolis 6, Indiana 

Vandlvere, Edgar F., Jr., M A (Duke) Radio Engineer, Technical Information Div., 
Fed. Comm. Comm., Washington, D. C 

Wadman, Alton J., B.S, (Mass Inst of Technology) Chief, Burst Pattern Analysis Section, 
VI Fuge Div , NOL, 8720 Colesville, Rd , Stiver Spring, Md. 

Watkins, Assoc. Prof. John H., Ph .D (Yale) Dept of Public Health, Yale School of Medi¬ 
cine, New Haven, Conn 

Wurtele, Zlvla S., M,A, (California) Assistant in Math, Statistics, Columbia U,, 102 Lex¬ 
ington Ave,, N Y. C 16, N. Y. 

Zwlnggl, Prof. Ernst, Ph D. (Berne) University of Basle, Siibdireotor Easier Life Ins. 
Co,, Kapellenslr 28, Basle, Switzerland. 
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Summary. Several statistical techniques are proposed for economically ana¬ 
lyzing large masses of data by means of punched-card equipment; most of these 
techmques require only a counting sorter. The methods proposed are de¬ 
signed especially for situations where data are inexpensive compared to the 
cost of analysis by means of statistically “efficient” or “most powerful” pro¬ 
cedures. The principal technique is the use of functions of order statistics, 
which we call systematic statistics. 

It is demonstrated that certam order statistics are asymptotically jointly 
distributed according to the normal multivariate law. 

For large samples drawn from normally distributed variables we describe 
and give the efficiencies of rapid methods: 

i) for estimating the mean by using 1, 2, • • ■, 10 suitably chosen order 
statistics; (cf p. 386) 

ii) for estimating the standard deviation by using 2, 4, or 8 suitably chosen 
order statistics; (cf. p. 389) 

iii) for estimating the correlation coefficient whether other parameters of the 
normal bivariate distribution are known or not (three sorting and three 
counting operations are involved) (cf. p. 394). 

The efficiencies of procedures ii) and iii) are compared with the efficiencies of 
other estimates which do not involve sums of squares or products 

1. Introduction. The purpose of this paper is to contribute some results 
concerning the use of order statistics in the statistical analysis of large masses 
of data. The present results deal particularly with estimation when normally 
distributed variables are present. Solutions to all problems considered have 
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been especially designed for use with punched-card equipment although for 
most of the results a counting sorter is adequate. 

Until recently mathematical statisticians hftye spent a great deal of effort 
developing “efficient statistics” and “most powerful tests." This concentration 
of effort has often led to neglect of questions of economy. Indeed some may 
have confused the meaning of technical statistical terms “efficient" and “ef¬ 
ficiency” with the layman’s concept of their meaning. No matter how much 
energetic activity is put into analysis and computation, it seems reasonable to 
inquire whether the output of information is comparable in value to the input 
measured in dollars, man-hours, or otherwi^. Alternatively we may inquire 
whether comparable results could have been obtained by smaller expenditures 
In some fields where statistics is widely used, the collection of large masses of 
data is inexpensive compared to the cost of analysis. Often the value of the 
statistical information gleaned from the sample decreases rapidly as the time 
between collection of data and action on their interpretation increases. Under 
these conditions, it is important to have quick, inexpensive methods for analyzing 
data, because economy demands militate against the use of lengthy, costly 
(even if more precise) statistical methods. A good example of a practical 
alternative is given by the control chart method in the field of industrial quality 
control. The sample range rather than the sample standard deviation is used 
almost invariably in spite of its larger variance. One reason is that, after brief 
training, persons with slight arithmetical knowledge can compute the range 
quickly and accurately, while the more complicated formula for the sample 
standard deviation would create a permanent stumbling block. Largely as a 
result of simplifying and routmizing statistical methods, industry now handles 
large masses of data on production adequately and profitably. Although the 
sample standard deviation can give a statistically more efficient estimate of the 
population standard deviation, if collection of data is inexpensive compared to 
cost of analysis and users can compute a dozen ranges to one standard deviation, 
it is easy to see that economy lies with the less efficient statistic. 

It should not he thought that inefficient statistics are being recommended for 
all situations There are many cases where observations are very expensive, 
and obtaining a few more would entail great delay. Examples of this situation 
arise m agricultural experiments, where it often takes a season to get a set of 
observations, and where each observation is very expensive. In such cases the 
experimenters want to squeeze every drop of information out of their data. 
In these situations inefficient statistics would bo uneconomical, and are not 
recommended. 

A situation that often arises is that data are acquired in the natural course of 
administration of an organization. These data arc filed away rmtil the accumula¬ 
tion becomes mountainous. From time to time questions arise which can be 
answered by reference to the accumulated information. How much of these data 
wiU be used in the construction of say, estimates of parameters, depends on the 
precision desired for the answer. It will however often be less expensive to 
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get the desired precision by increasing the sample size by dipping deeper into 
the stock of data m the files, and using crude techniques of analysis, than to 
attain the required precision by restricting the sample size to the minimum 
necessary for use with “efficient” statistics. 

It wiU often happen m other fields such as educational testing that it is less 
expensive to gather enough data to make the analysis by crude methods suf¬ 
ficiently precise, than to use the m ini mum sample sizes required by more refined 
methods. In some cases, as a result of the type of operation being carried out 
sample sizes are more than adequate for the purposes of estimation and testing 
significance. The experimenters have little interest m milking the last drop of 
information out of their data. Under these circumstances statistical workers 
would be glad to forsake the usual methods of analysis for rapid, inexpensive 
techniques that would offer adequate information^'but for many problems such 
techniques are not available. 

In the present paper several such techniques will be developed. For the 
most part we shall consider statistical methods which are applicable to estimating 
parameters. In a later paper we intend to consider some useful “inefficient” 
tests of significance. 


2. Order statistics. U a sample On = x\, x'l, • • • , *1 of size n is drawn from 
a continuous probability density function /(x). We may rearrange and renumber 
the observations within the sample so that 


( 1 ) 


Xi < Xi < ' ‘ < x„ 


(the occurrence of equalities is not considered because continuity implies zero 
probability for such events). The i.’s are sometimes called order statistics. 
On occasion we write x{i) rather than x,. Throughout this paper the use of 
primes on subscripted x’s indicates that the observations are taken without 
regard to order, while unprimed subscripted x’s indicate that the,observations 
are order statistics satisfying (1). Similarly x(u,) will represent the n,th order 
statistic, while x'(n.) would represent the n.th observation, if the observations 
were numbered in some random order. The notation here is essentially the 
opposite of usual usage, in which attention is called to the order statistics by 
the device of primes or the introduction of a new letter. The present reversal 
of usage seems justified by the viewpoint of the article—that in the problems 
under consideration the use of order statistics is the natural procedure. 

An example of a useful order statistic is the median; when n = 2m 1 (m = 
0, 1, ■ • ■ )i is called the median and may be used to estimate the population 
median, i.e. u defined by 



•/—eo 


dt 


= i 


2 - 


In the case of symmetric distributions, the population mean coincides with u 
and Xm+i will be an unbiased estimate of it as well. When n = 2m (m = 1, 2, 
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• ■ the median is often defined as + x„^t)- The median so defined is 
an unbiased estimate of the population median in the case of symmetric dis- 
tributions; however for most asymmetric distributions ^(x„ + wiU only 
be unbiased asymptotically, that is in the limit as n increases without bound. 
For another definition of the sample median see Jackson [8, 1921], When x is 
distributed according to the normal distribution 


N(x, a, o^) — 


■ ^ ~(l/VJ)(x-o)> 


the variance of the median is well known to tend to ‘Ka^f2n as n increases. 

It is doubtful whether we can accurately credit anyone with the introduction 
of the median. However for some of the results in the theory of order statistics 
it is easier to give credit. In this section we will restrict the discussion to the 
order statistics themselves, as opposed to the general class of statistics, such as 
the range (a;„ — xi), which are derived from order statistics We shah call 
the general class of statistics which are derived from order statistics, and use 
the value ordering (1) m their construction, sysiemahe statistics. 

The large sample distribution of extreme values (examples Xr, for r, s 
fixed and n —> <») has been considered by Tippett [17, 1925] in connection with 
the range of samples drawn from normal populations; by Fisher and Tippett 
[3, 1928] in an attempt to close the gap between the linuting form of the dis¬ 
tribution and results tabled by Tippett [17], by Gumbel [5, 1934] (and in many 
other papers, a large bibliography is available in [6, Gumbel 1939]), who dealt 
with the more general case r > 1, while the others mentioned considered the 
special case of r = 1, and by Smirnoff who considers the general case of Xy , 
in [15, 1935] and also [16] the limiting form of the joint distribution of Xr, x,, 
for r and s fixed as n —oo. 

In the present paper we shall not usually be concerned with the distribution 
of extreme values, but shall rather he considering the limiting form of the joint 
distribution of x(ni), x(n 2 ), • • •, x(?ife), satisfying 

Condition 1. .lim — = ; i = 1,2, ■■■, Ic; 

n'-*ao 71 


\i < Xs < • • • < Xfc . 


In other words the proportion of observations less than or equal to x{n,) tends 
to a fixed proportion which is boumlod away from 0 and 1 as n mcreases. K. 
Pearson [13, 1920] supplies the information necessary to obtain the limiting 
distribution of x{ni), and limiting joint distribution of x{ni), xin^). Smirnoff 
gives more rigorous derivations of the limiting form of the marginal distribution 
of the x(nj) [15, 1935] and the limiting form of the joint distribution of x(n,i) 
and x(nj) [16] under rather general conditions. Kendall [10, 1943, pp. 211-14] 
gives a demonstration leading to the limiting form of the joint distribution. 

Since we will be concerned with statements about the asymptotic properties 
of the distributions of certain statistics, it may be useful to include a short dis- 
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cussion of their implications both practical and theoretical. If we have a 
statistic ^(On) based on a sample 0„: a:i, ■ ,Xn drawn from a population 

with cumulative distribution function F{x) it often happens that the function 
(^ — 9)/£r„ = 2 /n , where o-„ is a function of n is such that 

(A) lim P(?/„ < t) = dx. 

When tjiis condition (A) is satisfied we often say: 6 ts asymptotically normally 
distributed with mean d and variance a%. We will not be m error if we use the 
statement in italics provided we interpret it as synonymous with (A). How¬ 
ever there are some pitfalls which must be avoided In the first place condition 
(A) may be true even if the distribution function of , or of has no moments 
even of fractional orders for any n. Consequently we do not imply by the itali¬ 
cized statement that lim P[d(0„)] = 6 , nor tl>at lim [[E(B^) — [^(0)]**) = 

n —*00 n—*oo 

, for, as mentioned, these expressions need not exist for (A) to be true. In¬ 
deed we shall demonstrate that Condition (A) is satisfied for certain statistics 
even if their distribution functions are as momentless as the startling distribu¬ 
tions constructed by Brown and Tukey [1, 1946]. Of course it may be the case 
that all moments of the distribution of 6 exist and converge as n —«> to the 
moments of a normal distribution with mean 0 and variance . Since this 
implies (A), but not conversely, this is a stronger convergence condition than 

(A) . (See for example J H. Curtiss [2, 1942].) However the important im¬ 
plication of (A) is that for sufficiently large n each percentage point of the 
distribution of d will be as close as we please to the value which we would compute 
from a normal distribution with mean 0 and variance o-“„, independent of whether 
the distribution of d has these moments or not. 

Similarly if we have several statistics 62 , Ok, each depending upon 
the sample 0„ : xi, , • • , x'„, we shall say that the 0, are asymptotically jointly 

normally distributed with means 0,, variances (r\{n), and covariances p,,a-,(r,, when 

lim P{yi <ti,yi < < 2 , ■ ■ ■, Vk < tk) 

n-*oo 

(B) fti rh f‘i 

= K / ■■■ e dxi dx 2 • • • dx/t, 

CO V— OO BO 

where y^ = (0, — 0,)/c,, and is the quadratic form associated with a set of 
k ]omtly normally distributed variables with variances unity and covariances 
Ptj , and iC is a nor m aliz in g constant. Once again the statistics 5,- may not 
have moments or product moments, the point that interests us is that the 
probability that the point with coordinates (^i, 1 ^ 2 , ■ ■ ■, ^k) fs-Hs in a certain 
region in a fc-dimensional space can be given as accurately as we please for 
sufficiently large samples by the right side of (B). 

Smce the practicing statistician is very often really interested in the prob¬ 
ability that a point will fall in a particular region, rather than in the variance 
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or standard deviation of the distribution itself, the concepts of asymptotic 
normality given in (A) and (B) will usually not have unfortunate consequences. 
For example, the practicing statistician will usually be grateful that the sample 
size can be made sufficiently large that the probability of a statistic falling into 
a certain small interval can be made as near unity as he pleases, and will not 
usually be concerned with the fact that, say, the variance of the statistic may 
be unbounded. 

Of course, a very real question may arise; how large must w be so that the 
probability of a statistic falling within a particular interval can be sufficiently 
closely approximated by the asymptotic formulas? If in any particular case 
the sample size must be ridiculously large, asymptotic theory loses much of its 
practical value. However for statistics of the type we shall usually discuss, 
computation has indicated that in many cases the asymptotic theory holds 
very well for quite small samples 

For the demonstration of the joint asymptotic normality of several order 
statistics we shall use the following two lemmas. 

Lemma 1. If a random vaT^able i5(0„) is asymptotically normally distributed 
converging stochastically to 8, and has asymptotic variance (r^(n) —y 0, where n 

n-~*«o 

is the size of the sample On ' xi , X 2 , • • , Xn , drawn from the probability density 
function hix), and g0) is a single-valued function with a nonvanishing continuous 
derivative g'(p) in the neighborhood of d = 6, then g{d) is asymptotically normally 
distributed converging stochastically to g{9) with asymptotic variance <r\[g'i6)f 

Proof, By the conditions of the lemma 


lim P 




1 

■\/ 2ir 



e du. 


Now if <<r„ = Afl, A9 = 0 — 8, using the mean value theorem there is a in 
the interval [fl, ^], such that 


90) = 90) + (^ - 9)g'{e,), 

which implies 

lim P < i) = lim P < i), 9'(ei) ^ 0, 

where is a function of n. However lim g'{0i) = g'{$) so we may write 

lim P (tzJ < A = lio, p ^ \ ^ ^ 0, 

n-*«o \ ffn / r»-*oo \ ffnQ / 


where the form of the expression on the right is the one reqmred to complete 
the proof of the lemma. 

Of course if we have several random variables ■ • •, > we can prove 

by an almost identical argument that 

Lemma 2. If the random variables are asymptotically jointly normally 
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distributed converging stochastically to 6 ,, and have asymptotic variances tr\ (n) 0, 

and covariances pi^Uicr ,, where n is the size of the sample On : , -Ta, ■ •, Xn drawn 

from the probability density function h{x), and gi{ 6 ,), i = 1, 2, • ■, k, are single¬ 
valued functions with nonvanishing continuous derivatives g[0i) in the neighbor¬ 
hood of Bt = 9r, then the gt{ 6 i) are jointly asymptotically normally distributed with 
means g^idt), variances a\[g[idC)f and covariances Pt]<ri<T 3 gti 6 ,)g',{ 6 j) 

The following condition represents restrictions on the probability density 
function f{x) sufficient for the derivation of the limiting form of the joint dis¬ 
tribution of the x{nt) satisfymg Condition 1 

Condition 2. The probability density function f{x) is continuous, and does not 
vanish in the neighborhood of Ui, where 

/ fix) dx = X,, i = 1 , 2 , ■ ,k. 

J—ce 


If we recall the discussion of condition (B) above, the theorem of Pearson 
and Smirnoff may be stated: 

Theorem 1. If a sample On : Xi , Xi, Xn is draum from fix) satisfying 
Condition 2 , and if xiui), xinf) satisfy Condition 1 as n —> °°, then x(ni), xini) 
are asymptotically distributed according to the normal bivariate distribution with 
means ui, uj, 

/ fix) dx = X,, 

J—00 

and variances 


and covariance 


2 _ X.(l - X.) 

' n[/(u.)P ’ 


i = 1 , 2 , 


_ Xi(l - Xa) 
nfiuOfiui)' 


Theorem 1 has an obvious generalization which seems not to have been carried 
out in the literature The generalization may be stated: 

Theorem 2 If a sample 0„ ; xi, Xa , x„ is drawn from fix) satisfying 
Condition 2, and if xini), a:(n 2 ), • • xinu) satisfy Condition 1 os n —> oo, then 
the xin,), i = 1 , 2 , ■■■, k, are asymptotically distributed according to the nor¬ 
mal multivariate distribution, with means u,, 


and variances 



X 


» 1 


X,il - X.) 
nfiu,)^ ’ 


i = 1 , 2 , ,k, 


2 

Oi 
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and covariances 


_ X.(l - X,) 


I < i < j < k. 


m = 


Proof. We shall carry out the demonstration for the uniform distribution 

f 1, 0 < r < 1, 

[o, elsewhere, 

and then utilize the fact that by a suitable transformation of the uniform dis¬ 
tribution we may get any f{x) satisfying Condition 2. Of course for the par¬ 
ticular case of the uniform distribution all moments of the x^ni) exist and con¬ 
verge to those of the asymptotic theory. 

The joint probability density of the x(n,), satisfying Condition 1 and drawn 
from fix), is given by 

n\ 

g[xini), xin,), , a:(n*)] = 


( 2 ) 


(wi - l)l(ra - nj,)l n in, - n,_i - 1)1 

1-2 

O ^a(fll) \ /•! \ n-n* Jb r 


Performing the indicated integrations we get from the right of (2) 


(3) 






where G is the multinomial coefficient on the right of (2). It is well known 

Tli , 71, 

that for the uniform distribution , or asymptotically ~y— 

1, 2, • • •,k. We make the transformation Vn, leading to 


(4) 




- rii-i , Ivi ~ 


+ 


Vi 


n / 


/ n — nk _ 

\ n Vn) 
Using the usual technique of factoring out expressions like 

we rewrite (4) with Uj as a new constant, and setting Xi = — 


(6) 


0 


TT A . Y «-"«-i-Y _ Vk Y~* 

\ (X* ~ Xt-d-v/n/ \ (1 ~ ^k)Vn/ 
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Nowtakmg the logarithm of (5), expanding, neglecting terms higher, 

collecting terms and taking the antilogarithm we get the approximate asymp¬ 
totic distribution of the order statistics 


( 6 ) 


g(x(ni), xini), • • •, a:(n*)) 


= Cz exp 



_ X,-i 

(X^+i — Xi)(X» — Xf-i) 



V 

Xi - x._./J’ 


where^Xo = 0, Xw-i = 1- Now setting up the matrix of the coefficients of the 
quadratic expression in the exponent 


A = Xt+i — X.-i 

(X.V1 - X.)(\f - X._i) ’ 

t = 1,2, • ■ • , fc; ri., = 0, 11 - y I > 1. 

iances we need 


— 1 ri,—1,1 


Xi — X<—1 ’ 


To obtain the variances and covar- 


_ cofactor of .<1.-, in || ri.,, || 
determinant A,/ 


(see for example Wilke [18, p. 63 et seq.]). Now 


(7) 


*+i 1 

IA I = determinant Ay = H r-r— ; 

1 X, “ Xi-i 


cofactor of An = X.(l — X,) I A |, i = 


cofactor of A,-,- 


X.d - X,) I A I, 
XKl - XO 1 A 1, 


1 , 2 , 
i <j 
j < i. 


k. 


This completes the proof for the uniform distribution. 

If the uniform distribution is transformed into a probability density function 
f{x) satisfying Condition 2, by an order preserving transformation, we appeal 
to Lemma 2. We notice that the xin,) are transformed into y[a:(n,)], and that 
the probability that a:(n.) falls in the interval [m, , u, Auj] is transformed into 
the probability that g[x(n,)] falls m the interval [g{u,), g{ui -|- Aui)]. Using 
the mean value theorem we may write 

g{Ui -f Au.) = g(u{) -f Au,g'(u',), 
where u', lies in the interval [u,, u, -|- Au<]. However 

lim g'(ui) = 

The density for the uniform distribution in the interval [w,, Ui -f AuJ is just 
AW(, and this same density will tend to f{u,)Auig'{ui). Therefore g'iui) = 
l//(ui), which completes the proof of Theorem 2. 

It would often be useful to know the small sample distribution of the order 
statistics, particularly in the case where the sample is drawn from a normal. 
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Fisher and Yates’ tables [4] give the expected values of the order statistics up 
to samples of size 50 However it would be very useful in the development 
of certain small sample statistics to have further information. It is perhaps too 
much to expect tabulated distribution functions, but at least the variances 
and covariances would be useful. A joint effort has resulted in the calculation 
for samples n = 2, 3, ■ •, 10 of the expected values to five decimal places, 
the variances to four decimal places, and the covariances to nearly two decimal 
places. It is expected that these tables ivill be published shortly. 

3. Estimates of the mean of a normal distribution. It will be important 
in what follows to define efficiency and to indicate its interpretation. Then we 
shall construct some estimates of the means of certain distributions and compute 
their efficiencies. Except for the tables given, the discussion is applicable to 
the estimation of the mean of any symmetric distribution; and, of course, the 
concept of efficiency is still more general in its application. A statistic ^(0„), 
where On is the sample, is said to be an efficient estimate of 6 if 

i) \/n 0 — B) IS asymptotically normally distributed with zero mean and 
finite variance, and 

ii) for any other statistic S’ with s/niS' — B) asymptotically normally dis¬ 
tributed with zero mean and variance ciS'), 

The ratio c{S)/<!^{S') is termed the efficiency of S' if S is an efficient estimate 
of d. For discussion see Wilks [18, 1943]. The concepts of efficient statistic 
or estimate and of efficiency were introduced by R. A. Fisher They serve as 
one measure of the amount of information a statistic draws from a sample. 
It is also common practice to speak of relative efficiencies, for example, of the 
statistics S' and S" described in ii) above, we say if o-^iS') < ir^{S") that the 
efficiency of S" relative to S' is the ratio of the smaller variance to the larger. 
This concept of efficiency has sometimes been used when the normality assump¬ 
tion has been violated by one or both statistics, when one or both are biased, 
and when small samples are considered. When used under these conditions 
the concept of efficiency becomes more difficult to interpret, although a compari¬ 
son of the variation of two statistics about the value they are commonly esti¬ 
mating is often of value 

In the case of estimates of the mean a of a variable which is normaOy dis¬ 
tributed according to N(x, a, from a sample of n, we can often express the 
variance of an asymptotically unbiased estimate as cr^(,S^) = The sample 

mean S = 'Six[fn is an efficient estimate of a with variance a^/n. Then in such 
cases the efficiency of S^ in estimating a is l/fc,. The interpretation is merely 
that to obtain the same precision using St as is possible with S, one must use 
a sample k, times as large 

Bearing in mind that we are at present searching for economical methods 
for analyzing large samples, it is clear that the concept of efficiency offers us a 
practical way of comparing cost of information with cost of obtaining it. 
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In the present section and in sections 4 and 5 we shall develop certain sys¬ 
tematic estimates of parameters of normally distributed variables. Our pro¬ 
cedure then will be to compare the efficiency of the systematic estimates with 
the efficient statistic for estimating the parameter in question, and also in sec¬ 
tions 4 and 5 we compare our estimates with a statistic not involving squares 
or products. Of course the efficient statistic for estimating the mean of a normal 
is the sample mean, therefore in this section we will only compare our estimates 
with the sample mean. 

We can construct unbiased estimates of the mean of a normal distribution 
from linear combinations of suitably chosen order statistics. These systematic 
statistics wdl be asymptotically normally distributed if the order statistics 
from which they are derived satisfy Condition 1. We will restrict ourselves to 
a useful practical case where equal weights are used. In other words the esti¬ 
mate discussed is just the average of k order statistics k~^'2x{n,). Suppose 
x{ni), i = 1, 2, •••, k satisfy Condition 1, that i?[x(nv)] = i/[a:(ni:_,+i)], so that 
£![2a:(7i,)] = a. An important unsolved question is to discover what spacing 
of the x(nt) will yield minimum variance, and thereafter at what rate does the 
efficiency of this optimumly spaced estimate increase with fc. Computational 
methods bog down rapidly after & = 3. Because so little is known about this 
problem it seems worthwhile to offer some results for three arbitrary spacings 
(these results are of course useful in analyzmg data). 

If the «(«,) satisfy Theorem 2 we may approximate the variance cf the sys¬ 
tematic statistic h == Sa:(n,)/A: by the usual formula 

(8) <r“(4) = E[Mn.)/kf - [EiXx{n:)/k)]\ 


We lose no generality by assuming the mean and variance of the underlying 
normal to be 0 and 1 respectively Then using the fact that Su, = 0, and 
the result of Theorem 1 we rewrite (8) as 


(9) (f^0k) = jE[2(x(n0 - u,)/kf 


1 r * 

= — y 

k^n l‘^i 


^i(l ^i) I 2 ^ 


- X, ) 




fJ, 


iT 


where/m = /(u™). 

Using the symmetry which makes X, = 1 — X*_,>i, /, = A-i+i, and the fact 
that for & = 2r -j- 1, fr+i = 1/-s/2t, X^+i = we may simplify the right side 
of equation (9) with the following results for fc = 1,2, ■ ■ ■, 7. The factor 1/fc* 
has not been disturbed. We also write the general formulas for the simplified 
form of (9), but we omit a rather lengthy combinatorial argument which es¬ 
tablishes the generalization 


fc = 1: 


TT 

2n 


fc = 2: 


2Xi 

4n/? 
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—3 • ^ r 


9« L/i 


h 


+ 


a 


( 10 ) 


fc = 4: 


k = 5-. 


/fc = 6: 


k = 7: 




1 /l/j 


2 Xi Xa' 


16n Ul 

2 rXi 2 Xi Xa 1 1 ^”1 

2 -s;li!+w.+ 7 ! + '^( 7 > + 7 .) + iJ 

2 r^i , 1 , 2 Xi 2 Xi 2 Xa ”1 

^ L /1 /. /i/» /i/. ^ /«/* J 


36n 

' Sxh ' /i/. 


1 Xa , Xj 2 Xi 2 Xi 

2 + 72 + 72 + r^ + j‘^ + 


^a 
Si Si 


+ 


-^(7‘ + S + )i) + '4] 


k = 2r: 


-A- r y 4.2 y — 


r > 1 


fc = 2r + 1: 


2 r ( 2 r)’ 

(2r 4- l)»n L 2 


4r) + '\/27r + 

i -1 u 


?]■ 


r > 1. 


In addition to the possibility of minimizing the equations of (10) by numerical 
methods, three other procedures suggest themselves: i) to space the order 
statistics uniformly in probability; ii) to choose those k order statistics whose 
expected values are equal to the expected values of the order statistics in a 
sample of size fc drawn from a unit normal; iu) to choose X, = (i — i)/fc. The 
following table lists for fc = 1, 2, and 3 the expected values u, of the order sta¬ 
tistics and the probability to the left of the expected values X< for each of the 
procedures. The chosen order statistics are counted from left to right. It 
will be noticed that the third method gives very good results, and has the value 
of simplicity of formula. The following table gives a comparison between the 
efficiencies resulting from spacing by the three methods. The three optimum 
cases are included for completeness. 

Statisticians planning to use the method of expected values suggested above 
will find Fisher and Yates [4, 1943] table of the expected values of the order 
statistics in samples of size fc drawn from a unit normal helpful for computing 
the X,. Alternatively the following table of X; might be used. 

As an example of the use of Table III, suppose we are using the expected 
value method for estimating the mean of a large sample drawn from a normal 
distribution N{x, a, cr“). If we are willing to use 6 observations out of 1000 for 
this purpose Table III indicates the selection of Xwa, Saai, xui, i Xxw , xaes ■ 
Furthermore Table II indicate^ that the variance of the estimate of a based 
on the average of these six observations will be approximaliely ( 7 */.94871, n = 1000. 
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4. Estimates of the standard deviation. The statistic 

s® = £ (x[ - xf/in - 1), 

1-1 


TABLE I 


Comparison of the order statistics which would he chosen according to each of the 
four procedures for suhsamples of ic = 1,2,3 


k 

Order 

Statistic 

Optimum. 

Equal 

Probability 

Expected Values 

x,=(t- 

m 

Ut 

X. 

u. 

X. 

It. 

X. 

u. 

X( 

1 

First 









2 

First 

-.6121 



.3333 

-.5642 

.2863 

-.6745 



Second 

.6121 

.7298 


.6667 

.5642 

.7137 

.6746 

|B 

3 

First 


.1826 

-.6745 


-.8463 

.1967 

-.9674 

.1667 


Second 

liaaa! 









Third 

.9066 

.8174 

.6745 


.8463 


.9674 

.8333 


TABLE II 

Cornparison of the efficiencies of four methods of spacing k order statistics used 
m the construction of an estimate of the mean 


k 

X. = t/(fc-t-l) 

Expected 

Values* 

X,=»(i—i)/fc 

Optimum 

1 

.637 

.637 

.637 

.637 

2 

.793 

.809 

.808 

.810 

3 

.860 

.878 

.878 

.879 

4 

.896 

.914 

913 


5 

918 

.933 

934 


6 

.933 

.948 

.948 


7 

.944 

.956 

.957 


8 

.962 

.963 

.963 


9 

.957 

.968 

.969 


10 

.962 

.972 

.973 



* The Wi are chosen equal to the expected values of the order statistics of a sample of 
size k. 


where x = well known to be an unbiased estimate of the popula¬ 

tion variance <r^, for n > 1. However s is not in general an unbiased estimate 
of <r. We are not interested here in the question of when we should estimate a 
and when it is more advantageous to estimate o^. All we want is to have an 
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unbiased estimate of cr, based on sums of squares, to compare with another 
unbiased estimate based on order statistics. In the case of observations drawn 
from a normal distribution 


( 11 ) 


/ = - 11 ) 
r(^n) 



} 


is an unbiased estimate of a- (sec for example Kenney [11], with variance 


( 12 ) 


<r^(s') 


~ r(M« - 1]) 

. r(in) 



-- 1 ) - 1 


2 

or . 


TABLE III* 

P{x < u:ik) X lOS w.i* = E(x^ik), a^iljb is ihe ith order statistic in a sample of 
size h drawn from a normal disinhntion Nix, 0,1) 


X 

1 

2 

3 

4 

5 

6 

■ 

8 

9 

1 










2 

2863 

7137 








3 

1987 


8013 







4 

1516 

3832 

6168 

8484 






5 


3103 

5000 

6897 

8776 





6 



4201 

5799 

7395 

8975 




7 


2244 

3622 


6378 

7756 

9119 



8 


1971 

3182 

4394 

mmm 

6818 

8030 

9227 


9 


1766 

2837 

3919 

hmII 


7163 

8244 

9312 



1684 

2559 

3536 

4612 

5488 

6464 

7441 

7416 


* The table is given to more places than necessary for the purpose suggested because it 
may be of interest in other applications The E(x,\ii) from which the table was derived 
were computed to five decimal places 

For most practical purposes however, when n > 10, the bias in s is negligible. 
For large samples v“(s') approaches <j^/2n. 


4A. The range as an estimate of a. As mentioned in the Introduction, 
section 1, it is now common practice in industry to estimate the standard devia¬ 
tion by means of a multiple of the range J?' = c„(a:„ — a:i), for small samples, 
where c„ = l/[K(j/„) — i/„ and i/i being the greatest and least observations 

drawn from a sample of size n from a normal distribution Niy, a, 1). Although 
we are principally interested in large sample statistics, for the sake of complete¬ 
ness, we shall include a few remarks about the use of the range in small samples. 

Now R' IS an unbiased estimate of o-, and its variance may be computed for 
small samples, see for example Hartley [7, 1942]. In the present case, although 
both R' and s' are unbiased estimates of a, they are not normally distributed. 
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nor are wc considering their asymptotic properties; therefore the previously 
defined concept of efficiency does not apply. We may however use the ratio 
of the variances as an arbitrary measure of the relative precision of the two 
statistics. The following table lists the ratio of the variances of the two sta¬ 
tistics, as well as the variances themselves expressed as a multiple of the popu¬ 
lation variance for samples of size n = 2, 3, ■■■, 10. 


4B. Quasi ranges for estimating a. The fact that the ratio a’‘(s')/(r^(lt') 
falls off in Table IV as n increases makes it reasonable to inquire whether it 
might not be worthwhile to change the systematic estimate slightly by using 
the statistic Ci|„[a:„_i — Xj], or more generally Cri„[a:„_r — Xr+i] where Crin is the 
multiplicative constant which makes the expression an unbiased estimate of a 
(in particular c,|n is the constant to be used when we count in r -h 1 observations 
from each end of a sample of size n, thus Cri„ = l/[E(y„-r — 2/r+i)] where the 

TABLE IV 


Relative precision of s' and R', and their variances expressed as a multiple of rr®, 

the population variance 


n 

aHs'lUKR') 



2 

1.000 

.570 

.570 

3 

.990 

.273 

.276 

4 

.977 

.178 

.182 

5 

.962 

.132 

.137 

0 

.932 

.104 

.112 

7 

.910 

.0864 

.0949 

8 

.889 

.0738 

.0830 

9 

.869 

.0643 

.0740 

10 

.851 

.0570 

.0670 


y’s are drawn from N{y, a, 1)). This is certainly the case for large values of n, 
but with the aid of the unpublished tables mentioned at the close of section 2, 
we can say that it seems not to be advantageous to use ci|„[x„_i — ccj] for n < 10. 
Indeed the variance ciiio[a ;9 — xi], for the unit normal seems to be about 10, 
as compared with <r*(E')/cr* = .067 as given by Table IV, for n = 10. The 
uncertainty in the above statements is due to a question of significant figures. 

Considerations which suggest constructing a statistic based on the difference 
of two order statistics which are not extreme values in small samples, weigh 
even more heavily m large samples. A reasonable estimate of e for normal 
distiibutions, which could be calculated rapidly by means of punched-card 
equipment is 


[x{rh) - x(ni)], 


(13) 


<r = 
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where the x{ni) satisfy Condition 1, and where c = 112 — Ui, Ut and ui are the 
expected values of the and ni order statistics of a sample of size n drawn from 
a unit normal. Without loss of generality we shall assume the xi are drawn 


from a unit normal. Furthermore we let — = Aj = 1 — Xi = 1 — —. Of 

n n 

course a will be asymptotically normally distributed, with variance 




=—r 

nc® _ 


Aid - Xi) ^ X,(l - A’) 


2\i(l - A,) 




( 14 ) ^21 

Because of symmetry /(ui) = /(iia); using this and the fact that Ai = 1 — Aj, 
we can reduce (14) to 


(15) 


_ 2 Xi(l - 2Ai) 
nc« [/(wi)l^ * 


We are interested in optimum spacing in the minimum variance sense. The 
minimum for occurs when Ai * .0694, and for that value of Ai, ff*(a-) ^ 
.767 v“/n. Asymptotically a' is also normally distributed, with cr“(fi') = vV2n. 
Therefore we may speak of the efficiency of ^ as an estimate of a as .662. It is 
useful to know that the graph of is very flat in the neighborhood of the 
minimum, and therefore varying Ai by .01 dr .02 will make little difference m 
the efficiency of the estimate S' (providing of course that c is appropriately 
adjusted). K. Pearson [13] suggested this estimate in 1920. It is amazing that 
■with punched-card equipment available it is practically never used when the 
appropriate conditions described in the Introduction are present. 

The occasionally used semi-interquartile range, defined by Ai = .25 has an 
efficiency of only .37 and an efficiency relative to 3- of only .56. 

As in the case of the estimate of the mean by systematic statistics, it is per¬ 
tinent to inquire what advantage may be gained by using more order statistics 
in the construction of the estimate of <r. If we construct an estimate based on 
four order statistics, and then minimize the variance, it is clear that the extreme 
pair of observations will be pushed still further out into the tails of the dis¬ 
tribution. This is unsatisfactory from two pdints of -view in practice: i) we will 
not actually have an infinite number of observations, therefore the approxima¬ 
tion concerning the normality of the order statistics may not be adequate if Ai 
is too small, even in the presence of truly normal data; ii) the distribution 
functions met in practice often do not satisfy the required assumption of norm¬ 
ality, although over the central portion of the function containing most of the 
probability, say except for the 6% in each tail normality may be a good approxi¬ 
mation. In view of these two points it seems preferable to change the question 
slightly and ask what advantage will accrue from holding two observations at 
the optimum values just discussed (say Xj. = .07, = .93) and introducing 

two additional observations more centrally located. 

We define a new statistic 


( 16 ) 


[a:(n 4 ) -f- ^(nj) — x(r^) — a:(ni)], 
c 
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c' = E[x{ni) + x{ns) — x{n^ — x{n^\ wliere the observations are dravm from 
a unit normal. We take = 1 — X 4 , Xa = 1 — Xj, Xi = .07. It turns out that 
a {a') la mimmized for Xs in the neighborhood of .20, and that the eflEiciency com¬ 
pared with s' is a little more than .75. Thus an increase of two observations in 
the construction of our estimate of a increases the efficiency from .65 to .76. 
We get practically the same result for .16 < Xa < .22. 

Furthermore, it turns out that using Xi = .02, Xa = .08, X 3 = .15, X* = .25, 
Xb = .75, Xb = .85, X: = .92, Xs = .98, one can get an estimate of <r based on 
eight order statistics which has an efficiency of .896. This estimate is more 
efficient than either the mean deviation about the mean or median for esti¬ 
mating (T The estimate is of course 

c" = [a:(ng) + x{ni) - 1 - x{nt) -f x{ns) — a:(n 4 ) — xM — x{n 2 ) — z(ni)]/C, 
where C = 10.34. 

To summarize: in estimating the standard deviation a- of a normal distribution 
from a large sample of size n, an unbiased estimate of a is 

C 


where c = E{y— Vf) where the y’s are drawn from Niy^ a, 1). The estimate 
^ is asymptotically normally distributed with variance 


a/.,. _ 2 Xi(l — 2 X 1 ) 
nc* ’ 

where Xi = r/n, f{ui) = N{E{Xr), 0, <r*). We minimize o-’(a-) for large samples 
when Xi — .0694, and for that value of Xj, 




.7Q7<r^ 

n 


The unbiased estimate of <r 

<J — — (Xn—r-t-i “f“ Xa— f-}-l ~ 3^5 3V) 

C 

may be used in lieu of v. If Xi = r/n, Xa = s/n we find 

c\o' 1 Xi = .07, Xj = .20) =:= . 

n 

4C. The mean deviations about the mean and median. The next level of 

computational difficulty we might consider for the construction of an estimate 
of IT is the process of addition. The mean deviation about the mean is a well 
known, but not often used statistic. It is defined by 

n 

m.d. = ^ I — S \/n. 

t—1 


( 17 ) 
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For large samples from a normal distribution the expected value of m.d. is 
'o 

- (T, therefore to obtain an unbiased estimate of <r we define the new statistic 




= vl”-' 


d. Now for large samples A has variance — 2)]/n, or an 


efficiency of .884. However there are slight awkwardnesses in the computation 
of A which the mean deviation about the median does not have. 

It turns out that for samples of size a = 2jn + 1 drawn from a normal dis¬ 
tribution N {y, a, 0) the statistic 

/tT 23 I *1 ~ ®Fn+l 1 


(18) 


Jlf' 




2 m 


asymptotically has mean o- and variance 


Thus in estimating the standard deviation of a normal distribution from 
large samples we can get an efficiency of .65 by the judicious selection of two 
observations from the sample, an efficiency of .75 by using four observations, 
and an efficiency of .88 by using the mean deviation of all the observations from 
either the mean or the median of the sample, and an efficiency of .90 by using 
eight order statistics. 


5. Estimation of the correlatioi] coefficient. In the present section we con¬ 
sider the estimation of the correlation coefficient of a normal bivariate population: 


( 20 ) 


fix, y) = 


2)ro-*<r*\/l — P* 


exp 


. 2(l-p^)( 


{x - a)» , {y - ly 2p{x - a)(y - b) 


a 

Cm 


+ 


2 

(Ty 


^9 


)]• 


The efficient estimate of p in a sample 0„ ; (x(, j/(), (x(, i/s), • ■ ■, (x(,, y'n) drawn 
from the density (20) is 

J^ix, - x'){y\ - y) 


( 21 ) 


r = 


[23(a;i - - yf]^ 


There are numerous other techniques in the literature for estimating p, among 
them i) the tetrachoric correlation coefficient which depends on a four-fold table, 
ii) the adjusted rank correlation coefficient which depends on assigning ranks to 
the X and y observations. These and other estimates of the correlation co¬ 
efficient are discussed by Kendall [10]. 

We shall be concerned with the construction of some estimates of the cor¬ 
relation coefficient which are particularly adapted for use with punched-card 
equipment, A counting sorter is adequate for the first two cases discussed; 
in line with our previous development we shall then consider a technique which 
uses simple addition of the observed values, but does not require sums of squares 
or products (in the special case where variances of x and v are emml'' 
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SA. Estiiiiiation of p when means and standard deviations are known. Let 

us suppose that the means and variances of the variables x and y, distributed 
according to (20) are given, and consider the problem of estimating the cor¬ 
relation coefficient p from a sample of size n. There will be no generality lost 
by assuming a = b = 0, ai = al = 1. The technique used will be to construct 
lines 2 / = 0, a: = dbfc, which cut the a:j/-plane into six parts. We will form an 
estimate of p based upon the number of observations falling in the four comers. 
Figure 1 represents the lines laid out in the manner suggested in connection with 
a scatter diagram of 25 observations; naturally the method is recommended for 



xaa-kcr,^ * = a x=*+lco-j 

Fig. 1 Diagram of the Construction Described in Paragraph 6A with a Sample op 

25 Observations Superimposed 

use only with large samples, the 25 observations are for purposes of illustration 
only. More specifically after assigning the special values mentioned immedi¬ 
ately above to the means and variances in (20), we define 



*g0 ^eg 

Pi = fix, y) dx dy, 

Jo J]6 

Pb = f f fix, y) dx dy, 

J—oe *^00 

(22) 

jh = [ f fix, y) dx dy, 

Jo J-» 

P4= f f fix,y)dxdy, 
J—06 Jjfc 


Pb = f f fix, y) dx dy = 
J— 06 J— Jb 

j^Nix, 0,1) dx. 
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We denote by n, the number of observations falling into the region containing 
probability density p,. Of course = n. Now we may write the joint 

probability distribution of the n, as 

(23) gini, Mj, Ws, rn) = ‘ • 

4 

remembering that rii — n — 2 • 

1 


We shall now derive the maximum likelihood estimate of p from (23). Taking 
the logarithm of (23) we have 

6 

(24) log = log c + log p., 

1-1 


where c is the multinomial coefficient on the right of (23). 
with respect to p gives 


(25) 


d(logg) ^ ynip. 

dp ^ Pi 


Differentiating (24) 


where ^ 

dp 


c dps 

of course -f- 
dp 


0 because p6 is functionally independent of p. 


To get p, the maximum likelihood estimate of p, under our restrictions, we must 
equate the right of (25) to zero and solve for p. Before proceeding it will be 
useful to note the following relations: 


Pi = P8 ; P2 = P4 

(26) Pi = -pi ; pj = -Pi iPi = Pi ;pi = pi 

+ P 4 = N{x, 0, l)dx = X; P 2 + p 3 = jf N{x, 0, l)dx = X. 


Pi 


If after making appropriate substitutions from (26) we set the right of (25) 
equal to zero we get 

nipi _ Thpi wapi _ w«pi _ Q 

Pi X - pi pi X — Pi ’ 
and since in general pi ^ 0, the condition is that 


(27) ni + nj _ Pi 

W 2 4" R* X — Pi 

Unless all four of the Ui are zero (which is unlikely for reasonable values of X 
because n is large), it is possible to find a value of p which will make the right 
side of (27) equal to the ratio formed from the observations on the left, and 
the value of p so determmed is the maximum likelihood estimate p under the 
restrictions we have imposed, In practice this equation may be solved by con- 
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suiting a table of the bivariate normal distribution—see for example K. Pe^-rson 
[14]. Alternatively [27] may be solved by referring to Figure 3. Truman Kelley 
[9, 1939] has considered a closely related problem in connection with the valida¬ 
tion of test items. 

It may be inquired w'hether it would not be preferable to reduce the present 
design to a tetrachoric case by using only the cutting lines a; = 0, y = 0. An 
investigation of the variance of p reveals that such is not the case. We proceed 
to determine the asymptotic variance by means of the usual maximum likelihood 
technique. Differentiating (25) once more we have 

^ 28 ) q ) ^ - pb 

dp^ ^ Pi ’ 

where pi = . We note that F(n,) = np., therefore 

dp^ 


(29) 



but since the derivative of a sum is equal to the sum of its derivatives, and 
Pi + p* = X, Pj + Pa = X) the first sum in the square brackets vanishes. Suit¬ 
able substitutions from (26) will reduce the second sum so that we get 


(30) 


^r d°(Iogy) 1 ^ 2npiX 

L dp^ J pi(x - pi) ■ 


Therefore asymptotically p is normally distributed with variance 


(31) 


^ Pi(X - pi) ^ 
~ 2nXpi 


In general the optimum value (in the minimum variance sense) of X which deter¬ 
mines the cuttmg lines x = ±fc ivill depend on the true value of p. To carry 
out the min i mi zation process in general will require fairly extensive computa¬ 
tions, which we feel would be justified. For the present we shall restrict our¬ 
selves to minimizing <r’^(p) for the case p = 0. 

We have 

p, =^exp[-ifc^]= ^/(fc). 

when p = 0, and pi = 2 X. This gives 

v (p I P = 0) = ■ 

We wish to minimize the expression on the right. We recall that a similar 
expression \i/fl was to be minimized in section 3 when the optimum pair of 
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observations for estimating the mean of a normal distribution was found. 
Using the previous results we have X = .2702, k ~ .6121; which gives us finally 

(33) = 0) = — . 

n 

To summarize: if a sample of size n is drawn from a normal bivariate popula¬ 
tion with known means a*, a, and variances er* and vj, but unknown correlation p, the 
maximum likelihood estimate of p based on the number of observations falling in 
the four comers of the plane determined by the lines x = a, d: kox ,y = a^ is found 
by solving for p the equation 

_ ni Us _ Pj 

ni 4- na -f «3 + »< T * 

where n^ is the number of observations falling in the upper right, na in the upper 
left, ni in the lower left, jii in the lower right hand comer, and p, is the probability 
density in the region into which the Ui fall, X == pi + . The variance of this 

estimate p is given by 

which is minimized for p = Oby selling k — .6121, X = .2702, giving 


Oajtl (p 1 P — 0) 


1.939 

n 


On the other hand if the usual tetrachoric estimate is used with x = 0, p = 0 
as the cutting lines we get (T%{p | p = 0) = r/in. The relative efficiency of 
the tetrachoric compared with the optimum statistic is therefore .787. The 
variance of the efficient estimate r givep in (25) when p = 0 is l/». Consequently 
the efficiency of our estimate p compared to that of r is about .615 for the special 
case p = 0 under consideration. This means about twice as large a sample is 
reqmred to get the same precision with p as with r. Doubling the sample and 
using the cruder statistic p may often be an economical procedure. 

It may be surmised that a still better estimate of p could be constructed by 
employing four cutting lines, say x = ±fc, y = The simplifications which 

we used to obtain the estimate p no longer hold when we use this new construc¬ 
tion. However, it is still possible to compute the minimum variance of the 
new estimate which we will call p', for the special case p = 0. It again turns 
out that k = .6121 minimizes and we get 


(34) 


2 /-/ I 1.52 

Vopt(P 1 P = 0) =*= - , 

n 
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which makes the efficiency of p' (compared with r) about .66 as compared with 
515 for p. This suggests that if some very simple teehnique can be found for 
obtaining p, p' would be worth using. Uniortunately the author has not 
been able to construct a rapid way of finding p'. 


5B. Estimation of p when the parameters are unknown. A more practical 
situation than the case treated in paragraph 5A, is the case in which all param¬ 
eters of (20) are unknown. This case will be treated by means of order sta¬ 
tistics. We construct an order statistic analogue of the estimate p which we 
will call p*. In general the procedure will be as follows: Each of the A observa¬ 
tions in the sample has an x coordinate and a y coordinate 

i) order the observations with respect to the x coordinate; 

li) discard all observations except the n with the largest x coordinates called 
the right set and the n with the smallest x coordinates called the left set, retain¬ 
ing, therefore, 2n observations, 

iii) order the pooled 27i observations with respect to the y coordinate; 

iv) break the 2n observations into two sets of n observations each; the upper 
set containing the n observations with the greatest y coordinates, and the 
lower set containing the n observations with the smallest y coordinates, 

v) reorder the upper set of observations with respect to the x coordinate; the 
n observations will be divided mto those whose x coordinates belong to the 
right set and those whose x coordinates belong to the left set, 

vi) the estimate p* will be obtained by solving the equation 


(35) 




where n* is the number of observations in the upper set which are also numbers 

of the right set and p* is / f(x, y)dx dy, while /(x, y) is the bivariate 

Jo Jk* 

r n * 

normal (20) with o-i; = cr„ , = 1, o = 6 = 0, and / N(x, 0, 1) dx = — = Xi . 

J/fc* A 


Figure 2 represents graphically the construction described above for a scatter 
diagram composed of 25 observations. Of course the number 25 is only for 
purposes of illustration, as the method is only proposed for use withlarge samples. 

The procedure of ordering the x’s and choosing the right and left sets of ob¬ 
servations is analogous to cutting the bivariate distribution by the two lines 
X = dh/c as described m paragraph 5A, indeed x = Xn+i and x = x^-n are the 
corresponding lines, but they vary from sample to sample. To continue the 
analogy, ordering the remaining observations with respect to y and dividing 
them into upper and lower sets of equal size is like cutting the plane with the 
line y = 0. Finally formula (35) is analogous to formula (27) Another similar 
change is that where formerly we had among relations (26) the equalities pi = 
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Pi) P 2 = , we now have the corresponding relations^amonpt the number of 

observations in the four corners of the plane, namely ni = nj , n* = n’t which 


0 

P 

1 

0 

0 

0 

" 0 “ 

0 

0 • 

0 n«n|%nj = e 

7/777/777/7//7 

77776777777777 

7/77777777/7 

0 

0 

0 

0 

0 

0 

0 

n/ssH-Sn-a 

0 

0 


n»n,*fn| sfl x,»x 1 i,kX n■■ni*+n4'* 

Fio. 2, Diaobam of the Construction Described in Pabaqbaph 6B on the Basis op 26 

Observations n = 6 

can readily be seen by inspection of the fourfold table we have constructed below 
(omitting all reference to iV — 27i pairs of observations we have discarded). 



Left set 

Right set 

Totals 

Upper set. 

♦ 

712 

♦ 1 

Til 1 

n 

Lower set. 

* 

713 

Ui 1 

1 

71 

Totals. , , 

1 

71 ' 

n 

2n 


We have dwelt at length upon the analogy between the two constructions 
because one of the principal difficulties in working with order statistics is to 
design a mathematically workable model. The author has found it fruitful 
when constructing systematic statistics to study a workable analogy which does 
not involve the order statistics directly, and then to build upon correspondences 
such as those described. 
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Some may not wish to read further in this paragraph when they are informed 
that asymptotically the variance of p* is essentially the same as that of p. They 
should proceed to page 404. For the others we proceed to the demonstration. 

Suppose we draw a sample of N pairs of observations (a:,, y'i) from the bi¬ 
variate normal (20). If we discard from these all pairs except those with the 
n largest x, and the n smallest X{, we are left with the right set and the left set. 
We shall need the joint distribution of a:„ 4 i and 

(36) - 2„ - 2)1 (I. 

a *W-n \ AT—2n—2 / |."0 \ n 

g{x)dx\ U g{x)dx) g{x„+i))g{x!f^„). 

where g(a;) is the marginal distribution of a; obtained from (20), N{x, a, al). 
We assume x„+i, a:v_n satisfy Condition 1. Considering a:n 4 i, Xtr-n as fixed 
and given for the moment we wish to look at the distribution of the y coordinates. 
We may consider the y coordinates of the observations in the right set as drawn 
from the distribution of y 


<p'iv) 


f fix, y) dx 

[ fix, y) dx 

-03 ACO 

/ / fix, y) dx dy 

•^-00 

/ gix) dx 


Similarly the y coordinates of the observations belonging to the left set maybe 
considered as indepehdently drawn from 




1 

fix, y) dx 1 

-00 •f- 

fix, y) dx 

'DO 


r ®n+l 

fix, y) dx dy 
’—00 ' 

1 gix) dx 

v—00 


To prevent confusion, in considering the y order statistics of the two sets, we 
shall designate those of the observations which are members of the right set 
by Ml, 112 , • •, ; while those observations belonging to the left set will have 

their ordered y coordinates designated vi,Vi,---,Vn. Of course the it’s and v’s 
separately satisfy an order relation like that given in (1). 

The first question we answer is: given x„+i, x^-n, what is the probability 
that when we collate the u’s and r’s and split the observations into the upper 
set and lower set (see iv). there will be exactly c observations in the lower set 
whose y coordinates are designated by u’s? In other words what is the prob¬ 
ability that exactly c members of the lower set belong to the right set? An 
example for small values of n may clarify the problem. Suppose n = 4,‘ and 
we observe u\ < vi < < Vi < < Ui •, the y coordinates of 

the lower set of observations are ui, i>i, i> 2 , Va, and only the observation with ui 
for its y coordinate belongs to the right set, so for this case c = 1. To return 
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to our general problem, the probability that there are exactly c observations 
which, are members of both the right set and the lower set is 

(37) P(c xy-,) - 1 - piK-c > u^+i) - p(Uc > 

where p{iv > z) is the probability that w is greater than z. Now writing <p{z) = 
f <p'(i)di, if/iz) = f we may rewrite (37) as 

J—tC J—tO 

Pi^C I .Tn-f-lj u) 

-ni f” («„-.)IV'(a,-.) 

(38) c!(n — c — 1)1 




- ^ - 1)1 rj dv„. 


•C41 


After integrating the first integral of (38) by parts and simplifying we can rewrite 
(38) as 

n' 


(39) 


P(c I a:„+i, Xy-fl) = ^ [lA(wm)]"~'[l — l('(*<c+i)]' 


+ 


w! 


- Dll 


f ('‘o + ll 


a”~'(l - a)'-^ da. 


(n — c)!(c — l)t 
We approximate the integral term of (39) by 

( ~ n- ~ )l |r :: iT![^(^°-*-») ~ - iA(wc+i)]'-‘ 

which leads us to the approximation 

P(C I ^n+l I ^X—Ti) 

(40) n\ 

[Huc+i)r'^ii - ,^(Mc+i)r'[i + (c - - v(w.)]. 


(n —c)lc! 

The joint distribution of Uc , u,+i is given by 
Q(Uc , ac.|.l j Xy^D 


(41) 


n' 




(c — 1)! (n — c — 1) 

Next we multiply P as given by (40) by Q from (4l) and integrate out Us. This 
gives us except for terms of 0 ^ and higher 


nln! 


[<p(KR)r 


(42) c!(7i — c — l)!c!(n — c)! 

• [1 <£)(m,+i)]"'^''V(Wc+i)]’'''[1 ^(Uc+l)Y<p'{Uc+l). 
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Wiien expression (42) is multiplied by (36), we finally get the approximate joint 
distribution of c, Xn+i, Xtt-n , «c+i. 

Before proceeding further we let 


V>(Me+l) = 


£ M«+1 

/ /(*. y) dx dy 


(43) 


where 


1 - X? 

■Uii+l 


Ik 

Vi 


1 - 'hs 


lA(«<i+l) = 


i« 


XJ 

r*jir-n 


Xf 


+ /•*«+! * * 

^ 1 = 3 (x)dx, Xa = I gix)dx. If we also let pi = 1 — 


~ P* ) Pi = Xi — Pa we can write 

12 (c, i llc+l) 


(44) 


“ A(A2 Ai) Pi P2 Ps p4 p4 Al A2 ) 


where the primes indicate derivatives of p* , X* , X* with respect to the appro¬ 
priate suppressed variables, Uc+i , x„+i, x^-n, respectively. 

We now proceed to the maximum likehhood estimate of p. We take the log¬ 
arithm of (44) and then take partial derivatives with respect to a the mean of 
X, h the mean of y, and p the correlation coefficient. After equating these 
partial derivatives to zero we have the following three maximum likelihood equa¬ 
tions wMch must be solved simultaneously to obtain the estimates d*, h*, and jo*: 


(45) 


(46) 


(47) 


j;_ ■ 
N _ 


N -2n-2 3(X* - Xi) n - c - 1 3^* 
X* — Xf 3o ^ da 


1 Pw 

Ni 

Nl 


, _c_ .n — c 
p* da 


pi 


4- = n 

da pf aaj 


— C — 1 dpt I ^ d^ VlZLS 

p* db p* db pf 


— c — 1 d^ . 

r\ f 


pf 


dp p* dp 


c dpi — c 


PT 


^ 4- = 0 

db pf db J ’ 

dp3 . C ^P4 I Q 

dp pi dp J 


where terms 0 have been neglected. Equations (45) and (46) are satisfied, 

again except for terms 0 , when d* = Kxn+i -f Xn-v), b* = u„^i. Using 

this information we examine (47) and find it satisfied when 

71 — c Pi 

(48) - - = i , 

c Xi - Pi ’ 


which is directly analogous to equation (27), and is the form promised in (35), 
if Tif = n — c. The estimate p* is obtained by solving (48) for p, where pi = 
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J pW fttc 

I I fix, y) dx dy, and f{x, y) is given by (20) with variances equal unity and 

means equal zero, and / gix) dx = = 1 — Xa ~ n/N', 

We shall not go through the derivation of (t’'(p*) here. The usual maximum 
likelihood technique may be used. It turns out that the covariances between 

d* and p* and between h* and p* are 0 • Neglecting such terms we find that 


the variance is 
(49) 


_ Pi(Xi - Pi) 

^ 2N\tpP ■ 


To summarize: if a sample of size N is drawn from a normal bivariate popula¬ 
tion with unknown parameters, the maximum likelihood estimate of p based on the 
2n observations composed of those observations with the n largest x coordinates and 
the n smallest x coordinates, may be obtained by solving for p the equation 


n ~ c _pi 
>?’ 


* r r 

where ^ > X* = n/N > 0, pi = f f(x, y j cti = 1, o* = Oj, = 0) dx dy, 

Jo Jk’ 

/ Nix, 0, 1) dx - and n — c is (he number of the 2n observations with 
Jh’ 

largest y coordinates, which also have largest x coordinates. The variance of this 
estimate p* is given by 

Jf^*\ _ Pi (^i ~ Pi) 

" ^ ~ ' -2N\*pP ’ 

and for p ~ Othe variance is minimized by choosing ~ .2702, that is by choosing 
that 27 per cent of the observations with largest x coordinates, and that 27 per cent 
with smallest x coordinates, and for this value of X* 


4pt (p*|p = 0) = 


1.939 

N 


Equation (49) is of course exactly analogous to the expression given in (31) 
for the case of known pieans and variances. Therefore if the variance minimiza¬ 
tion problem is solved in general for the case of paragraph 6A, the large sample 
solution of the problem for unknown means and variances will also be solved. 

Figure 3 may be used to obtain the estimates p or p* in case the methods of 
paragraphs 5A or 5B arc used. Essentially the figure solves equations (27) 
and (48). The procedure for the problem of paragraph 5A is 

i) when Wi + ns > nj -4- ni evaluate the ratio —;——~— = Xq and 

jii -b ni + ns + n* 
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find, the intersection of the line x = xa with the curve for the particular X being 
used; 

ii) through the point of intersection of the vertical line Xa = x and the X 
curve draw a horizontal line; 

iii) the value of p is indicated on the vertical axis at the point of intersection 
of the horizontal line and the vertical axis, 



Fio. 3 Curves for Estimating the Correlation Coefficient p 

Tlj -j- m 

iv) when rii + na < rij 4- Ut use the ratio xo = -;-;-;-and follow 

^ ni + + na + «.4 

the same procedure, p will be the negative of the number appearing on the 
vertical axis. 

Example. Suppose a sample of 1000 is drawn from a normal bivariate popula¬ 
tion for which the mean of x is a, and the mean of y is b, and the variance of 
X is Vi , all three parameters known (it is not necessary to know vj). The xy 
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plane is cut by the three lines a: = a zfc , y = a, where, say, k = .612, so 
that X = .27. Suppose we find the observations are distributed as follows: 

in the upi)er right-hand comer: 160 — ni 

in the lower left-hand corner: 170 = na 

in the upper left-hand corner: 110 = tij 

in the lower right-hand corner: 110 = . 

To estimate p we set up Xo = (ni + n»)/{‘nt 4- ns + na + n*) = 330/550 = .6. 
Referring to Figure 3 we find that the estimate of p, p = .20. 

In using Figure 3 for this case it is useful to know that for 


X = .50 

k = 0.000 

X = .27 

k = 0.612 

X = .40 

k = 0.253 

X = .20 

k = 0.841 

X = .30 

k = 0.524 

X = .10 

k = 1.282 


If the means and variances of the variables are unknown, we may use the 
method of paragraph 5B: 

i) when n — c > c evaluate the ratio (n — c)/n — xo, and find the inter¬ 
section of the line x = xa with the curve for the particular Xi being used; 

ii) through the point of intersection of the vertical line Xo = x and the Xi 
curve draw a horizontal line; 

iii) the value of p* is indicated on the vertical aids at the point of intersection 
of the horizontal line and the vertical axis; 

iv) when n — c < c, use the ratio c/n = xo and foUow the same procedure, 
p* will be the negative of the number appearing on the vertical axis. 

Example: Suppose a sample of 1000 is drawn from a normal bivariate popu¬ 
lation with all parameters unknown. Suppose we set n = 200, and follow 
the procedure given in paragraph 5B of this section, and suppose we find the 
observations are distributed as follows: 

in the upper right-hand corner: 50 = n — c 

then of course 

m the lower left-hand corner: 50 = n — c 
in the upper left-hand corner: 150 = c 
in the lower right-hand comer; 150 = c 

The estimate this time is clearly negative, so we set xo = c/n = 150/200 = .75. 
Referring to Figure 3 we find using the curve corresponding to X = .20 that 
the estimate of p, p = — .44. 

6C. The use of averages for estimating p when the variance ratio is known. 
Nair and Shrivastava [12,1942] have considered the use of means for estimating 
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regression coefficients wlien one observation is taken at each of n equally spaced 
fixed variates, a:, (f = 1, 2, ■ • ■, n), and y is normally distributed. Their pro¬ 
cedure was essentially to consider the ordered fixed variates, and to discard a 
group of observations in the interior, much as we discarded the set of observa- 
vations whose x coordinates were 2 :„ 4 i, ■ ■ , a:jvr_„ in paragraph 5B. The 

resulting estimates depended essentially on the averages of the y’s on the right 
and left sets of observations, and on the averages of the fixed a:’s in the two 
sets. 

In an unpublished manuscript George Brown has considered a problem even 
more closely related to the one considered in paragraph 5A Suppose x and y 
normally distributed according to (20) with equal variances and means 
equal to zero, (The ratio of variances must be known, equality is unnecessaty.) 
Retain only those observations for which j a:, | > fctr, and from them form the 
statistic 


(50) 


Pb 


y+ - y- 

X+ — X-' 


where y+ and x+ are the average of the n.i I’s and y‘B for which x, > kir and 
y- and x- are similarly defined foi the observations for which x, < —ka. 
Then pa is an unbiased estimate of p. Regarding the x’s as fixed variates it 
turns out that 


( 61 ) 


\pb) = 


(1 - (I 1 \ 

{x+ — \ni Tw) 


If we approximate by substituting expected values for observed values (55) 
turns out to be (1 — p“)<r*X/2iV[p(fc)f, where X = f g{x) dx, g{x) = N{x, 

CO 

0,1). The value of k which minimizes this expression is our old friend fc = .6121, 
which gives X = .2702 Therefore for p = 0 and large samples, the minimum 
variance is approximately 1.23 a^/N, for an efficiency of about .81. The relative 
efiiciency of the methods of paragraphs 5A and 5B are .635 compared with the 
present technique. 

We presume that the analogous order statistics construction would produce 
much the same result. Our mterest in the present technique is to supply an 
approximate answer to the question of what is to be gained by going from the 
counting technique proposed in paragraph 5B to the next level of computa¬ 
tional difficulty—addition. 
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THE NON-CENTRAL WISHART DISTRIBUTION AND CERTAIN 
PROBLEMS OF MULTIVARIATE STATISTICS^ 

By T. W. Awdebbon 

Cowles Commission for Research in Economics 

1. Summary. The non-central Wishart distribution is the joint distribution 
of the sums of squares and cross-products of the deviations from the sample 
means when the observations arise from a set of normal multivariate populations 
with constant covariance matrix but expected values that vary from observation 
to observation. The characteristic function for this distribution is obtained 
from the distribution of the observations (Theorem 1). By using the char¬ 
acteristic functions it is shown that the convolution of several non-central 
Wishart distributions is another non-central Wishart distribution (Theorem 2) 
A simple integral representation of the distribution m the general case is given 
(Theorem 3). The integrand is a function of the roots of a deteiminantal equa¬ 
tion involving the matrix of sums of squares and cross-products of deviations 
of observations and the matrix of sums of squares and cross-products of devia¬ 
tions of corresponding expected values. 

The knowledge of the non-central Wishart distribution is applied to two gen¬ 
eral problems of multivariate normal statistics. The moments of the gen¬ 
eralized variance, which IS the determinant of sums of squares and cross-products 
multiplied by a constant, are given for the cases of the expected values of the 
variates lying on a line (Theorem 4) and lying on a plane (Theorem 6) The 
likelihood ratio criterion for testing linear hypotheses can be expressed as the 
ratio of two determinants or as a symmetric function of the roots of a deter- 
mmantal equation In either case there is involved a matrix having a Wishart 
distribution and another matrix independently distributed such that the sum 
of these two matrices has a non-central Wishai t distribution When the null 
hypothesis is not true the moments of this criterion are given in the non-central 
planar case (Theorem 6). 

2. Introduction. The well-known Wishart distribution is the distribution of 
the sums of squares and cross-products of deviations from the sample 
means of observations from a multivariate normal distribution. If the 
expected values of the variates change from observation to observation 
(with the covariance matrix constant), the distribution of sums of squares and 
cross-products is the non-central Wishart distribution. This distribution has 
been given explicitly [1] for the simple cases of the non-central problem. If we 
think of the expected values of each observation as defining a point in a space of 
dimensionality equal to the number of variates, we can say that the cases 
handled are those in which the points corresponding to a sample lie on a line or 


1 Part of a thesis submitted to the Mathematics Department of Princeton Umversity in 
paitial fulfillment of the requirements lor the degree of Doctor of Philosophy, June, 1945 
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a plane. Although the explicit formulas for the distribution of higher rank are 
extremely complicated and have not been derived, the characteristic function 
is relatively simple. The distribution in general can be given in terms of a 
simple multiple integral 

The Wishart distribution is the basis of much of the sampling theory associ¬ 
ated with the multivariate normal distribution. It plays a role similar to that 
of the x^'-distribution in univariate normal theory. It can be used in deriving 
the distributions of the generalized and of the multiple correlation coefficient 
when all variates have a normal distribution; it is used in deriving the moments 
of the likelihood ratio criterion for testing the general linear hypothesis (including 
the test of the means of several populations being equal) as well as deriving the 
moments of other such criteria^). For the problems of the T* and the test of 
the linear hypothesis and many other problems, the non-central Wishart dis¬ 
tribution must be substituted for the central Wishart distribution when the 
null hypothesis is not true. That is, the non-central distribution can be the 
basis of obtaimng the power function for many tests in multivariate normal 
statistics. As an example of the application of the non-central Wishart dis¬ 
tribution to these prohlema, in this paper we obtain the moments of the gen¬ 
eralized variance and the moments of the criterion for linear hypotheses when 
the population means lie on a line or a plane. Applications to other problems 
such as testing collineiarity, comparing scales of measurement, and multiple 
regression in time series analysis will be published in a later paper [3]. Another 
problem to which this non-central theory can be applied is a method of estimat¬ 
ing the parameters of a single equation of a complete system of linear stochastic 
difference equations (developed by T. W. Anderson, M. A. Girshick and H. 
Rubin), 

In [1] it was shown that one can make linear transformations on the observa¬ 
tions which simplify the derivation of the non-central Wishart distribution in 
the linear and planar cases. Consider a set of JV multivariate normal popula¬ 
tions, each of p variates. Let the z-th (i = 1, 2, •••,?) variate of the a-th 
(a = 1, 2, • ■ ■, A) population be ; let the mean of the variate be 

(1) Eix,a) = n,a (t = 1, 2, • ••,p;q: = 1, 2, • W). 

Let the covariance matrix (of rank p) common to all N populations be 

II E{x,a Ml‘a)(aija Mia) || — |1 <^<1 1| 

{a = 1, 2, •••, N). 

The probability element of the x.a can be written as 

(2) 1 cr*' 1 ‘^(2x)-‘'’^ exp [- 1 i: (x,, - M.,)(x,„ - p,„)]n dx,,. 

1,1,a I,a 

where 



See Wilks [2] for example. 
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The sum of squares and cross-products of deviations from the means in a sample 
{a:<a) are 


(3) 

where 


a-I 


1 ” 

= TyS a:.. 

iV a-1 


The dimensionality, say t, of the space spanned by || |1 is equal to the rank of 

(4) II 7-., II = II 2 (w« — — Ji,) II , 


where 


1 

= V-rZ 




As a result of a linear transformation it was demonstrated that the distribution 

AT—1 

of a,} is the same as that of x',a x,a where the x,a have a normal multi- 

a»"l 

variate distribution with covariance matrix |1 er,;|| and expected values 

“ M»o( 

(t = 1, 2, • • •, p; « = 1, 2, • • •, IV — 1), 


such that 


Tij II = llZ 


The joint distribution of o.y is given for three cases: 

(i) Case t = Q: 

(6) Wia„ , cr., ,u,;P,N- 1, 0) = iTo 1 a' | | o., | exp [- ^E <r" o.J; 

(ii) Case t = 1: 

(6) W(a,,, (T., ,r„;p,N- 1,1) = iCi exp [-^ E 1 1 <r''' |*‘”“'’ I a., 


X exp [- ^E<r”owHEa.,T.-,]~‘'^'^’li(^^)(V'Ea.,r.-,); 

(iii) Case t = 2: 

W{a.,, <r., ,t.,:P,N - 1, 2 ) = iCj exp [- § E 1 


(7) 


X I o., I ^ exp [ i E v”oi,] E 2®“ io!r(^[lV - 2] -H w) 



412 


T. W. ANDERSON 


where 


^ n - i]), 


P—1 




P-.2 


KT^ = r(M^^ - 2 - »]), 


I„{x) is the Bessel function of purely imaginary argument, and ui and Ui are 
the two non-zero roots of 

(8) It- XA~' ( == 0 

(here T = || || and A = 1| flu 1|) The number iV — i is the number of 

degrees of freedom and t is the rank, The matrix || o-j, || we shall call the sigma 
matrix, and || t„ |j we shall call the means sigma matrix. 

Let Ki, Kj, - ■, Kp be the real, non-negative roots of the determmantal equation 

(9) 1 T - XS ( = 0 

(where 2 = II o’.; || )• There is a non-singular p X p matrix ^ (= || ^,, ||) 
such that 

(10) \['24'' = I 
and 

( 11 ) = 11 11 

(where I is the identity, is the transpose of SE' and 5u — 1 for i ~ j and 0 for 
i 9 ^ j). Then the quantities 

P 

(12) hi, = ^*) 4^%h ^jfc flftfc 

hik^l 

have the distribution W(btj, 5,,, ; p, n, t) where n = N — 1 and k? = 0 

for i = t + 1, t + 2, ■ ■ p). This IS the same distribution that would be 

derived if the b,,' were defined by 

n 

( 13 ) bij Vitx Vja 7 

a_l 


where the distribution of the yu is 

(14) (2,r)-^’’" exp [- hjl't (l/.= “ <c. S.-)!* • 

«-l a-.l 

This simplified distribution of the observations has been called the canonical 
form 
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3. The characteristic function of the non-central Wishart distribution. We 

shall find the characteristic function of the a., and 2a, ^ ^ j) as defined in (3). 

We first obtain the characteristic function of the b,, and 2b,, {i ^ j) as defined 
in (13) and then perform a linear transformation to obtain the characteristic 
function of the a,,. The characteristic function of the b„ and 2h,j (i ^ j) 
is defined as 

(15) fi^exp . 

where 


0,j — 0ji 

and t in the exponent is the imaginary quantity 
We can write (15) as 

£;^exp 1^4 2 y.a y,a 0„ 

= (27r)"*’'" f ■■■I expF— + i X 

J_ao J_ ao L «=1 t.J—1 J 

X n n • 


Let us first integrate the y,a for i = 1, 2, • • , p and a = t 1, t + 2, • • n, 
that is, make the integration 

• • I 

«0 v—oo 

■expF—z) f z z n n 

L ^ *“1 «=c+l a=“l J »—1 

This is, however, the characteristic function of a Wishart distribution with 
n — 1 degrees of freedom [4], namely 

( 16 ) I S., - 2i9„ . 

Now we must make the integration 


(17) 


(2,)->» exp [4 g. 


^ ^ ^ ~| J t 

rt y*ij ^ y^iiVin^xi "i" I ^ HH ^2/*^* 

^ tiBl |]al 11 — 1 _l »-“l 1=1 

There is a p X p matrix G ~ 11 6^^? 11 such that 


^ J yhx^khQk-i J 
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where 


dii, = Skh ~ SiOtA. 


Let us make the transformation 


2/ii| 0‘h^ki) d K) , 

k-1 

where 

iid‘Ni = iM.Air. 


Then the exponent of (17) within the integral sign is 



and the Jacobian of the transformation is 
Hence, the integral of (17) is 

(18) 1 d« 1“ exp [-i (^i: 4 - t . 

This result is obviously true if the 6jy are pure imaginary and sufficiently small 
so that II dtj II = II - 2idi, || (which is real in this case) is positive definite. 
For all complex di^ in a neighborhood of the origin (17) converges because the 
real part of [j 1| is positive definite. Similarly the integral of the derivative 
with respect to 6tj of the integrand converges for in this neighborhood. It 
follows that the (complex) derivative of the characteristic function exists in this 
neighborhood because the derivative of the integrand is measurable and is abso¬ 
lutely integrable. Therefore, the characteristic function is analytic in a neigh¬ 
borhood of the origin. From this it follows that the characteristic function is 
analytic in an open set containing the flat space of real 6i,. By analytic con¬ 
tinuation, then, (18) is the value of (17) in the open set containmg real 0,-,. The 
characteristic function (15) is the product of (16) and (18). Accordingly, we 
have the result that the characteristic function of the 6„and 2bij defined 

by (13) is 

(19) I d" p” exp [-i (i: ki - i: d^vj)]. 

It is clear that if x, = 0 (for all tj), this function reduces to the characteristic 
function of the Wishart distribution with n degrees of freedom, namely, 

(20) 1 - 2i0,^ r‘". 

It is interesting to note that (19) factors into two parts, one of which is (20) 
and the other is 


( 21 ) 
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The distribution function similarly factors mto two parts, one of which is the 
Wishart distribution, whose characteristic function is (20). Thus the non¬ 
central Wishart distribution function is the convolution of a function (central 
Wishart distribution) and another (the transform of (21) the first of which is 
a factor of this same non-central Wishart distribution. 

In the planar case the characteristic function can be written as 

exp [-^(«; - /\^)] _ exp [-U4 - 

I - 2ie„ ‘ I 5., - 2 ie„ ’ 

where ni -\r Ui = n. From this fact it is clear that the distribution for the 
planar case (if n > 2p + 2) is a convolution of two distributions each of the 
linear case. 

This deduction can also be made from the distribution (14). Let 

ni+ l 

a m3 
n 

+ £ 2/iaJ/ja. 

Then it is clear that the K, has the non-central Wishart distribution with ni 
degrees of freedom and parameter k? in the direction of the first coordinate 
axis, while the fe„- has the non-central Wishart distribution with n* degrees of 
freedom and parameter kI in the direction of the second coordinate axis. Since 

&t, = K, -f , 

the distribution of the b., is a convolution of the distributions of h',j and b",. 
In general the non-central distribution is the convolution of t distributions of 
the linear case (provided n > + f). 

It is easy to show that if one has two (or more) non-central Wishart dis¬ 
tributions of rank 1 with parameters m the same direction, the convolution is 
again a non-central Wishart distribution with parameter in the same direction. 
Suppose b[j and b” have non-central Wishart distributions with parameter k'i 
and k"1 in the direction of the first coordinate axes and ni and th degrees of 
freedom respectively. The characteristic functions are 

exp 

and 

1 l^-^exp [-Kki’ - 

The product is 

1 d*' l‘"exp - cJ“k?)], 

where n = m + ns and ki = -f k"\. 

Now let us deduce the characteristic function of the o,i and 20 ,-^ {i 9 ^ j). 

Since by (12) the b’s are transforms of the a’s we can write a,,= 2 bhk . 
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XIICIJ 

(22) E (exp = -E ^exp ^ , 

where 4>{j - <^y,. If we define 

(23) = i, 

then (22) can be derived by substituting (23) in (19). 

Let 

$ = Nul|. 

Then 

II di J| = Z) = ^'-'(2"^ - 2i<l>)4'“^ 

and 

D-i = - 2i4>)“^I''. 

The characteristic function of the a’s then can be written as 

exp [-^{<r('J'T4^') - ir[4'(S~' - 2z4.)“^4''4'r^']}] 


. jS-i - 2i$| . 

using (10) and (11) The denominator is 

(l^'l I^ I r*" I 2~‘ - 2i4> I*” 
and the numerator can be written as® 
exp [—^(tr (Af'4''4filf) — tr [M'’i''4'(S~' — 2z$)“^'4'M]) ] 
where 

M = II /Ua - Hi II 

and 

M'M = T. 


We may summarize in the following theorem: 

Theorem 1. G^ven o,y (i, j = 1,2, p) defined by (3) where the x,a {i = 
1, 2, • •, p, 0 ! = 1, 2, ■ • - , W) are distributed according to (2), the characteristic 
function of a,j and 2a, j {i ^ j) is 



’ The result follows from the fact that tr(AB) = tr(BA). 
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where 

and 

<l>t3 — 4>ji - 

Suppose we have two sets of quantities a',, and a", each set of which is dis¬ 
tributed according to a non-central Wishart distribution with sigma matrix 
II II, one having n' degrees of freedom, (or n"), means sigma matrix r\j (or 
T,,') of rank i! (or t"). Consideration of the characteristic functions (24) shows 
that 

/ I » 

CLij — dt) ”r dij 

has a non-central Wishart distribution with matrix || tr” |1, n' -f- n" degrees of 
freedom and a matrix 

lln-.ll = II r:, 11 -f II 4 ||. 

The rank of the distribution is equal to the rank of || r,, |1. This result can 
also be deduced from the representation of a'„ and a", in terms of observations 
from non-central normal populations. It is a straightforward generalization 
of the same result for central Wishart distributions. 

Theorem 2. The convolution of two or more non-central Wishart distributions 
with identical sigma matrices is a non-central Wishart distribution with means 
sigma matrix equal to the sum of the means sigma matrices of the components, 

4. An integral representation of the non-central Wishart distribution in the 
general case. It was shown in [1] that 

W{h, , , kU., ; P, n, t) = J \B - YY' dY 

where 

dB = n n - 

dY = nndy.,, 

i“i 

B = II 6.,-II, 

Y = ||y.J| 

K = 1|k.6.J| 

and the integration is on Y over the range 11 B 
This is equivalent to 


in = 1, 2, ■■■,£), 

in = 1,2, 

— YY' 11 positive semi-definite. 
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(25) I B I \ I - Y'Er^Y dY. 

The integration is over the range of Y for which || f || is positive 

semi-definite. 

There is a p by p matrix H = 11 Aiy | j such that 

= I 

H'K = W =^\\ wA, II, 

where are the roots of 

I K^i - \h'^ \ = 0, 

II 11 = II 

Then make the transformation to 2 = || z,, || by 

Y = HZ. 

The Jacobian of the transformation is 

I // 1‘ = I £ |‘‘. 

Then (26) can be written as 

(26) Ce-*''" I B lb"-*-!) I \ I - Z'Z p'"-*’"'-” dZ, 

Partition 

z = 

z, 

such that Z\ is square ({ X 0- Let I — Z[Zi = E'E, (in terms of Zi), where 
E is specified uniquely and consider the transformation of variables from Z 2 
to V defined by 

Zi = VE. 

Then (26) can be written as 

C-g-ifra I £ |Kr.-P-U J \ J - Z[Zi g'rtB-fZx) 

' I \i ~ F'y|‘'"-^“‘“'^dyi 

where 

Wi - l|u),i,tll (n, t = 1, 2, 0- 

The first integration is over the range (7 — ZiZi) positive semi-definite and the 
Second is over (7 — V'V) positive semi-definite. The value of the second 
integral is 
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/ 


J -y/y |lCn-P-<-l) ^y 


n r(§[n - 2< + 1 - il) 

i-i 


n r(Mn - / + 1 - tl) 

^-1 


Hence (26) can be written as 
(27) J 17 




with 

CT^ 

(28) 


1«>-(K'J0 nipn }p(li-l)+li® 
e ^ IT 

X n r(Hn - i + 1 - i]) n r(i[« - i + 1 - < 1 ). 

i—1 «—1 


The first part of (27) is, except for a constant factor, a central Wishart distri¬ 
bution with n degrees of freedom. The integral of the second part is obviously 
a symmetric function of the lo,. In terms of the o,‘y the to, are simply the 
roots of (8). We can sum these results in a theorem. 

Theorem 3. Given a sample of observations |a:,i,) {i — 1, 2, • ■ p; a = 1, 2, 
N) distributed according to (2), the probability density function of the sums 
of squares and cross products of deviations from the sample means defined by (3) is 


W I |J(W-J>-2> 


W(a.f, otj , T,j p, N — 1, t) — Ci\a 

•exp J j5,s “ 2 




( ( 

■ exp ^ to, 2 ,i H d2,{ 

t-i ii{—t 


integrated over 

t 

8i,( Zn Sit 

1—1 

positive semi-definite where Ci {n = N — 1) and ti, are defined by (28) and (4), 
respectively, and where w\ are the t non-zero roots of (8). 

6. The moments of the generalized variance in the linear and planar non- 
ccnixnl cRSftSa 

5.1. The linear case. The generalized variance, which is the determinant of 
the variances and covariances,* is a measure of the spread of the observations. 
If one t.biTilrR of the N observations of each variate as a vector in iV-space with 


* This definition of Wilks [5] was made in terms of variances and covariances defined by 
ao/N(from equation (3)) Since we consider a„/(N-l) to be the variances and covariances 
we define | a,y/(N-l) | as the generalized variance. 
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origin at the sample mean, fhe generalized variance is proportional to the square 
of the volume of the p dimensional parallelotope which is defined by these 
vectors as principal edges. Another geometric interpretation can be given in 
terras of the p-dimensional variate space. The generalized variance is pro¬ 
portional to the sum of the squared volumes of all possible parallelotopes that 
can be joined by choosing as the p principal edges p of the N sample vectors 
(origm at the sample mean). 

In this section we consider the moments of the generalized variance when the 
distributions of the observations are non-central multivariate normal. In 
terms of the first geometric representation this means that the center of one or 
more of the vector distributions is different from the others. For convenience 
we shall assume that the distribution of the observations {y.a} is according to 
(14). This will give as much generality as if we treated observations (x,a) 
having the distribution (2). Moreover, we shall consider the determinant of 
sums of squares and cross-products instead of the determinant of variances and 
covariances. It is clear that the determinantant | hvj |, defined by (13), is 

simply a multiple (by | 2 | (IV — 1)”) of , defined by (3). 

Let us first consider the linear case, i.e., k = 0 and = 0 (z = 2, • • •, p) 

in (14). The first of the p vectors is centered on the first coordinate axis, not 
at the origin. Then the probability density function of the h, is 


I t |j(n-p-l) 


L_J Y 


2*^V”''-!’ n r(H» - *J) 


a-o 2»"a!r(^n + a) ‘ 


We wish to find the moments E{ \ b/y |‘). Let 


b(y = fliSyr.y. 

Then s’, is the sum of squares of the t-th variate and 11 r<y 11 is the matrix of 
sample correlation coefficients. The Jacobian of this transformation (to sj, 
n,) is 

The probability element of the s^’b and r’s is 


( 30 ) 


exp \-^K sjl ]ft (fl?)*"-’ 

2bn^ip(,-l) g 

<-l 


Iny 


V y ^i) 

ilo 2>“a!r(^n + a) 


j, j, p 

nd(s?)n n 

•-1 »-i 1—i+i 
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It is clear from (30) that the si are distributed independently and that the set 
r ,3 have a joint distribution independent of the sj’s. Hence 


E(| K h = £(| 1‘) = n U, T). 


The probability element of s? (i = 2, 3, • • •, p) is 

which IS simply the x*-distribution. The h-th moment of s? (i = 2, 3, 

H[(s!)*] = Qn + h) _ 
r(§n) 


, p) is 


The probability element of si is 

(31) 


2.2\a 


(kX) 


1 ). 


21" “o 2*“a!r(§Tl + a) 

This is the x'^-distribution (non-central x'-distribution) which was given by 
Fisher [6]. Applying term-by-term integration (the series converges properly) 
we get the fi-th moment 

ElM ] - 2-e -- 2^jr - ( T n - + - a) " ’ 

The probability element of the r,j is the well known distribution of correlation 
coeflScients, 

I r,, A A , 

^ 11 II dr„ . 

2^ p(if„ _ ^-j) .-I ,-.+i 

1-1 


Since 


/Ij^ 

J TT 


P— 

i^Cn-p—1) p p H fl) 

*— n n dn, = . 

,_i r” ^{jn) 


ip(p—l) 


where the integration is over the entire (permissable) range o the rjj, we have 
as a consequence the /i-th moment of the determinant (since n is arbitrary) 


1*^ - r"-41n) . P 

•E(l ^^3 I ) “ J»—I I 4 p(p-i) il 11 

n r(l[ 7 i - f]) 


n r(M« - + A)r'’~^(ln) 

1—1 

n mn - iW-^in + h) 

1-1 
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Hence, the h-th. moment of | s,s/r,y | is 


(32) 


n + 2h- 0 ) 

2 >’* 1-1 _e"^** 

fi min - » 1 ) 

iwl 


^ 4* A "H «) 

ap *0 2 "air(fn + «) 


Let ns summarize this in a theorem for the an. 

Theorem 4. If the quantities a,y (i, j = 1, 2, ■•',?) have the distribution 
Wiflij , (r,y, rij; P) — li 1) defined by (6), then the mom&nts of\aij\ are given by 


S(l «./!*) 




i-i _ 

fi Tim -1 - fi) 


K^rim -i] + h +a) 
2 “a!r(MfV - 1 ] + a) ’ 


where k is the non-zero root of (9). 

The )i-th moment of the generalized variance 1 an/(N — 1 ) | is obtained by 
dividing the above expression by (N — I)***. 

If K = 0, expression (32) clearly reduces to the moment given by Wilks [ 5 ] 

fi r(i[n + 1 - + h) 

(33) 2"* -. 

fi r(i[n + 1 - fl) 


The expression (32) gives the moments of the generalized variance when the 
means of the observations are not fixed, but lie on a line. The distribution of 
I hjj 1 is not a simple function even in the central case. However, in any par¬ 
ticular case one could find the first few moments of ] h(y j and fit a distribution 
function. It is to be noted that the convergence of the series is nearly as rapid 
as that for 

5.2. The planar case. Next we shall treat the planar case for two dimensions. 
Suppose that xy 0 (z = 1 , 2 ). The probability density function of bn, bn, 

SiUci 622 35 


exp -1(4 -f Kj) - i 23 b„ 

L »-i 

(34) 2"vV 

^ 'V' ” bi*)]”(Ki bn kJ bii)^ 

^ ajt, 2‘‘>+«»a!/3ir(^[n - 1] + a)r(i7i + 2a + p)‘ 

Let 611 = Si, bjs = si, and bw = siSar. The Jacobian is SiSa. The probability 
element of s?, Sj and r is 


1 

^ ( 6 «ba - 


'rv 7 


(si)*"-'(4)‘’-'(l - 

^ V («^2 4s^)"( 1 - 4)°(44 + 48l)^d(s;) djaX) dr 

atio 2‘“+»/’«I^!r(Mn - 1] 4- a)r(in + 2a -b /S) ‘ 


( 35 ) 
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We wish to find S{[siS 2 (l — Let us first multiply (35) by (1 — r)’' 

and integrate from — 1 to + 1 . We then obtain 

w ^ (ki K2 Si S2)‘‘(/Ci Si + 1C2 Sa)— 1 ] + ft + a) djsl) d{sl) 
2‘“+^al/3!r(^[ji - 11 + a)r(§7i + 2a + /8)r(^7i + ;i + a) ‘ 

Next we multiply by (s?)'^(s 3 )*, set (k?s? + * 2 S 2 )'’/i 8 ! equal to 

and integrate si and sa from 0 to 00 . We obtain 

^([ 6,1622 - vl,f) 

(36) =2“exp[-K'^i + Ki)] 

Y r(|n + + g + ffi)r(in + A + a + ft) 

2*“+'’^+^* a!/3i!^2ir(i[n - 1] + a)r(^n + 2o + ;9i + ft) 

r(Kn — l] + h^ a) 

r(ltn + li + a) ’ 

which is the expected value we are seeking. 

Clearly this reduces to a special case of (32) if xi is set equal to zero. 

Now we consider the planar case in p dimensions. Geometrically we have 
p vectors in n-space. If the {y,a} are distributed according to (14) the mean 
point (i.e., center of distribution) of the first two vectors is different from the 
origin, but the mean point of each of the other p — 2 vectors is the origin. 
The vectors are distributed independently. The determinant 

n 

1 1 ~ £ J/ial/jor 

a—1 

is the square of the volume of the parallelepiped which can be expressed as 
V 1 V 2 ■ ■ • v, sin ft sin ft • • • sin 9p-i, 

where v, is the length of the t-th vector and d, is the angle between the (i + l)-st 
vector and the flat space determined by the first % vectors. The distribution of 
tij, • • •, Vp and ft , ■ • •, 0p^i is statistically independent of Vi, V 2 , and ft ; for 
no matter what the plane of the first two vectors is, the conditional distribution 
of the other variables is the same. Hence 

■E( 1 1 *) = E[{viV 2 sin ft)^*] • E[(t) 3 V 4 • • • v, sin ft • • • sin flp-i)**]. 

If the y’s had simply the distribution 
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(37) 

then the h-th. moment of 


1 


exp 


'-i±± »!.], 


( 27 r) 

11!)(; I would be (33), and the h-th moment of 


wl Uj sin’ Bi 


hii ia 
hu hjj 


would be 

2 Ssnr(HM + 1 - i) + h) 

n r(Hn + 1 - i]) 

Since the distribution olvt,Vi, - • •, Hj, and 9t, • ■ -, is the same whether the 

y’a are distributed according to (14) or (37), we have 

llr(Mn + 2 h+l-il) 

(38) E[(t-a... a, sin • sin -g-. 

n r(J[n -f 1 - i]) 

Multiplying (36) by (38) we obtain the ^-th moment of | &<y |, namely, 

llmtn + 2 h + 1 - *]) 

E( I 6,, r) = 2*” exp [- ^(«? -f kI)] g- 

nr(jb +1 - f]) 

1-8 

1 ! r(i[n - 1 ] -h a) 


r(^ 4 - h + “ + ;^») r(^[n — 1 ]+ h + a) 

f’(iw -f- 2a + di + /Sail'd^ -f- h + a) 

This result may be summai-ized as follows: 

Theobem 5. Let the probdbihty density function of the quantities a<y (i, j = 
1 , 2, • ■ p) be 


W{a{j, an tUi\PyN — 1 , 2 ) 

defined by (7). Then the h-th moment 0 / j a,y | is 


Ed^oT) = |a‘'l'‘2*'’exp[-i.i-H4 


-i] + h) 
ftr(i[h7-ij) 
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( 39 ) V -l]+h + a + ft)r(i[iV - 1] + + « + |9a^ 

2*“+'’^+'’*«!/3i!/ 32! - 2] + a)r(HAr - 1] + 2a + /3i + /3j) 

rm -2] + h + a) 

T{m -i] + h+ay 

with kI defined by (9). 

The h-th moment of the generalized variance | a,j/{N — 1) | is obtained by 
dividing the above expression by (N — 1)"'^. This formula holds for all h > 

-m - ?)■ 


6. The moments of the criterion for testing linear hypothesis in the linear and 
planar non-central cases. 

6.1. The moments of the criterion. There are several linear hypotheses con¬ 
cerning the means of multivariate normal populations that can be included in 
a general formulation of the problem. We shall first of all consider a simple 
case of a linear hypothesis and find the moments of the criterion under linear and 
planar alternatives. In Section 6.2 we shah indicate some linear hypotheses 
that can be reduced to this simple case. Regression problems and the problem 
of equality of means m several populations (studied by Wilks) are included. 

Suppose the variates (i = 1, 2, • - p; a = 1,2, • • •, n) and y^y (i = 1, 2, 
... p; y = 1, 2, • • •, g) have the probability element 

I *J |}(n+9) r r " 

(40) 

exp F - i 23 23 o'’(y,y — li,y)(y,y — Pjt)"]!!!! dZiiTlfldyiy 
Let us consider the hypothesis Ho that the means of mp y’s are zero, namely, 


Let 

(41) 

(42) 


Ha: n,y = 0 (f = 1,2, • • •, p;7 = 1, 2, ■ ■ •, m) 


Ofij 


2 V'y “it > 

T-l 


n 







(43) c,-,- — o„- + b {,. 

Then the likelihood ratio criterion for testing Ho, called by Hsu [8] the Wilks- 
Lawley hypothesis, is the i(ri + q) power of 


W = 


\K 


cy 


(44) 
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Under the null hypothesis the have a Wishart distribution with n degrees 
of freedom, and the a,j are distributed independently of bj, such that c,y has a 
Wishart distribution with n + m degrees of freedom. Wilks [ 6 ] has given the 
moments of W and in some special cases the distribution of W. 

We shall now obtain the moments of W for distributions specified by (40) 
where the rank of || fUy || (7 = 1 , 2 , • * m) is 2 , i.e., the planar case. Under 
this assumption the hj have a Wishart distribution with n degrees of freedom, 
the Oi/ are independently distributed in such a way that the Cij have a non¬ 
central Wishart distribution with n + m degrees of freedom. Let kI and kI 
be the non-zero roots of 


(45) 


S) M/r 


T-l 


0 . 


It is clear that the distribution of W is unchanged if is set equal to , Fur¬ 
thermore, we can take uj = k%j then the c,, are distributed according to 
Wictj, Sij, n’jStj', p,n, 2) with n -f- m degrees of freedom. The moments 
wiU be obtained by a method similar to that used by Wilks [ 6 ]. 

Let the expected value given by (39) be 


(46) ^^dc.jT) = K(n + m, h, p, x?), 


which is a constant depending on n + m, h, p, kI , and xj. If D(a,y) represents 
the distribution function of the a.v, one can write (46) as 

(47) K{n -h m, h, p, x?) = -- 

2*'’"T*'’''-«nr(i[n-fl“i]) 


/ I Cij r 1 K- 1*'"-'’-” exp [- i 2: b.,] Diaid] ft ft db^ U 


where dA is the volume element of the o,y, and where the integration is over 
the entire (pennissable) ranges of the hj and at ,. Equation (47) holds since 
the c’s are functions of the h’s and n’s. Multiplying (47) by 

(48) 2‘'’'‘ftr(Kn+l-»]), 

iml 


then replacing nhy g + 2 and dividing by (48) again, we obtain 

2‘'^"+’'’ftr(J[n + 1 - t] + i7) 

(49) Kin + m + 2 ( 7 , h, p, x?) -- 

2‘'”‘ftr(i[n + l -fl) 

_ 1 _ 

1 -zl) 

• / I c./ r I bii exp [- W I)(a.,)ft ftd 6 ./dA]l. 
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By definition the right hand side of (49) is the expected value of | c,j ^ | bij |'. 
Hence 

z*" n r(^[n + l- i] + g) 

■^'(l c*i I I 1®) = K(n + m + 2g, h, p, k<)-- 

n r(Mn +l-t\) 

1-1 

In this expression it is permissible to set h equal to —g (n could have been 
replaced by n + 2g in (47) to insure the argument of each T function being 
positive). Then we have 

E(W°) = H(| c r* I h, |“) 

2 ''f[r(i[n + l -t] + i7) 

= K{n + m + 2g, — g, p, k*)--. 

n r(i[n + 1 - il) 

Finally, the g-th moment is 

{[ r(i[n + ffi +1 -1]) n r(i[n +1 - 1 ] 4- ff) 

E(W°) = exp [-UkI + - 

n r(Mn + TO + l-i]4*ff)II r(i[n + 1 — i]) 

i-» i-l 

y r (<c!)‘'+^‘(<c^)°'^^»r(Mn + m] + « + p,)mn + m] + a + p,) 

' + m- l] + g + a)r(Mu + m] + a) 

r(i[n + m - 1] + «) "I 

’ r(§[n + m] + g + 2u +pi + Pt)j • 

We can summarize in the following theorem: 

Theorem 6 Let Zia {i = 1, 2, p: a = 1, 2, • • ■, n) and yiy (i = 1, 2, 

■ ■ ■> P; 7 = li 2, •• •, q) have (40) as a joint distribution. Define a,,, b{, and Cij 
by (41), (42), and (43), respectively. Let kI and kI be the non-zero roots of (45). 
Then the g-th moment of W, defined by (44), is (50). Expression (50) gives the 
moments of W in the planar case. The linear case is a special case of the planar 
case, that is, it is the planar case for kI = 0. The g-th moment of TT in the 
linear case is given by 

nr(M^ + m + 1 - i]) n T(Kw + 1 - t] + c) 

E{W°) = exp [-141 - 

(51) n r(M?i + m + l-i] + £7)II r(|[n + 1 - i]) 

1—2 1—1 

y y (ki)*^^ r(^[« + m] + pi) 

^ 2^^ft!r(Mn + m]+g + Pi)‘ 

For kI = 0, (51) reduces to the expresion given for the moments under the 
null hypothesis. 
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Wilks [7] has given the distribution of W under the null hypothesis for 
several special cases (i.e., certain pairs of n and p). In general, however, the 
distribution function is too complicated to write down explicitly. When the 
null hypothesis is not satisfied (i.e., at least one kI ^ 0) the distribution functions 
are yet more involved. Hence, we shall not write any explicitly. 

Hsu [8] has given the asymptotic distribution of W. Suppose that 




- t £ 

i.j-l 7-1 




tends to the limit as n tends to infinity (if the m’s are functions of n ). Then 
the limiting distribution of a: = — (n + g) log W (which equals —2 log A, where 
A is the likelihood ratio criterion) is 


(52) 


n—ijrni -i'I'i) Ipm-l —Ji’ V' _ ^0 ^ _ 

2** a! V{\pm + a)' 


That is, it is the distribution with pm degrees of freedom and parameter . 

For most purposes, alternative hypotheses of the meanb being on a line 
(ie., of rank one) are sufficiently general. In any particular case, one can 
compute from (61) mumerical values for several moments and then fit an appro¬ 
priate distribution function. If one wishes to consider alternative hypotheses 
of rank two, one can use (50) and similarly compute numerical values for mom¬ 
ents. The series in either (51) or (50) converge rapidly. To construct an 
approximate power function for linear alternatives, say, one would fit distribu¬ 
tion functions for several values of k\ and find the desired percentage levels. 

There is a matrix H d,y jj such that 


l|5wll = l|dijl!-||d.yir 


and 


ll«wIl = l|di/l|-|lXi«oll-|ldi,|r, 

where the X’s are roots of 


(63) 1 0., - 'Khij 1 = 0 

It follows that * 

l|c*y|| = l|d.',ll-l|(l + X,)«.,||-|ld,-yir. 

Then W can be written as 

141 • 141' 

141 ■ • 141' 

1 

n (1+ K) 


( 54 ) 
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The distribution of the roots of (53) in the linear case has been given by Roy 
[9] for fli, 0 dimensionality p.® The distribution in the planar case has been 
indicated by Anderson [3]. One could obtain the probability of JV not ex¬ 
ceeding a given value by integrating the X’s over the proper range. 

6.2. Examples of the general linear hypothesis. A number of hypotheses 
concerning the expected values of variates with multivariate normal distributions 
can be put into the form of Ha The equivalence of the hypotheses is demon¬ 
strated by means of linear transformations. 

As an example consider the hypothesis Hi that the means of several normal 
multivariate populations are equal when the respective covariance matrices 
aie equal. Let be the a-th (o = 1, 2, - • A“) observation of the i-th (i = 
1 ,2, ■ • •, p) variate in the u-th (ii = 1, 2, • , U) population. Let 

(55) £(0=p: (i= 1,2, --..p) 

(u = 1 , 2 , 


and let the covariance matrix be 11 tr,, 11 Then the hypothesis is 


(56) 


Hi •.p't = Mr 


For testing this hypothesis let 


(57) 

(58) 

where 

(59) 


U 

5., = E Z - ^r), 


a., = EA“(t“ - «.)(2r - 

U_1 


1 

1 tt Ar« 

5. = E E , 

iV u»I a-1 


= E iv“. 


(i = 1, 2, p) 

(u = l,2, ••■,17). 


The h,j have n = N — U degrees of freedom and c„ = a,, -|- h,j , have N — 1 
degrees of freedom. Then the N/2 root of the likelihood ratio criterion for Hi 
is W defined by (44). For this case equation (45) is 


where 


E — fi,)(M5‘ — Ml) — hffxi 


= 0 , 


M. = 




u 

M. • 


‘ Roy erroneously claims his distribution to hold for the planar case and higher rank. 
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Hsu has demonstrated that the general regression problem can be put into 
the form of . Suppose that xta {i = 1, 2, • • •, p; a = 1, 2, ■ • IV) follow 
a multivariate normal distribution with covariance matrix I1 <r</ II, and let the 
expected value of X{a be 

E(Xiix) — i PtrWra (S < iV — p), 

r-1 

where the qhy N matrix 

W= ||m„|| 

is of rank q. Let Ht be the hypothesis that 

Hi : -Bi = 11 Ptu 11-0 (i = 1, 2, • • p; u = 1, 2, • • •, m < q) 
with the w’s known. Let 

Wi = ||w««|| (m= 1,2, 1,2, •••,iV) 

Wi = ||tu„ i| 

(r = m + 1, •••, 9; a = 1, 2, •••, iV), 

X - l|x,.l| (i = l,2, •■.,p;a« 1,2, ...,JV). 
Let 

II II = XX' - XW'iWWT^WX 
II c.v II = XX' - XWiiWiW'ir^WiX'. 

(with 11 Cj/11 = XX' if W 2 = 0). Then the likelihood ratio criterion for Hz 
is the iV/2-th power of W, defined by (44). 

The equation (46) can be written in terms of Z, B\ , and W as 

(60) \BiWi{I - Wz{W2Wzr^Wi)W[B[ - AS 1 = 0 
for m < q. litn ~ q, (45) becomes 

(61) 1 BiWW'Bi - AS I = 0. 

In (60) and (61) there are no more non-zero roots than the rank of Bi . It is 
clear that the roots of (60) (or (61)) depend on the matrix W as well as Bi. The 
distribution of A the likelihood ratio criterion under the null hypothesis does not 
depend on the distribution of the matrix W (if W is not constant). However, the 
distribution when the null hypothesis is not satisfied does depend on /c? or on ki 
and Kj, and hence, on the distribution of the elements of W as well as the value 
of Bi. 

The special case of JVo for m = g = 1 gives as the likelihood ratio criterion 
as a function of Hotelling’s generalized T*. From the moments indicated in (50) 
we can deduce the distribution of IT* when the null hypothesis is not true [3]. 
This result has been obtained by Hsu [10] by another method. 
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ON HOTELLING’S WEIGHING PROBLEM^ 

By Alexandhr M. Mood 
Iowa State College 


1. Summary. The paper contains some solutions of the weighing problems 
proposed by Hotelling [1]. The experimental designs are applicable to a broad 
class of problems of measurement of similar objects. The chemical balance 
problem (in which objects may be placed in either of the two pans of the bal¬ 
ance) is almost completely solved by means of designs constructed from Hadamard 
matrices. Designs are provided both for a balance which has a bias and for 
one which has no bias. 

The spring balance problem (in which objects may be placed in only one pan) 
is completely solved when the balance is biased. For an unbiased spring 
balance, designs are given for small numbers of objects and weighing operations. 
Also the most efficient designs are found for the unbiased spring balance, but 
it is shown that in some cases these cannot be used unless the number of weigh- 

i large as the binomial coefficientf , ^ ) or ( , 

\W \Hp + 1 ) 


mgs 18 as I 


Uli 

) 


where p is the num¬ 


ber of objects. 

It is found that when p objects are weighed in iV > p weighings, the variances 
of the estimates of the weights are of the order of //N in the chemical balance 
case {a is the variance of a single weighing), and of the order of ii//N in the 
spring balance case. 


2. Introduction. The problem is fully discussed by Hotelling [1] and refers 
to the design of a certain class of simple experiments. We may consider the 
typical example of the class to be that of weighing several small objects on a 
chemical balance or other weighing device. Hotelling and Yates [2] have shown 
that the individual weights may be determined more accurately by weighing 
the objects in combinations rather than weighing each one separately. The 
designs are applicable to a great variety of problems of measurement, not only 
of weights, but of lengths, voltages and resistances, concentrations of chemicals 
in solutions, in fact any measurements such that the measure of a combination 
is a known linear function of the separate measures with numerically equal 
coefficients. The designs should be particularly useful in biological and chemical 
laboratories engaged in routine chemical analyses, We shall, however, in the 
interest of simplicity, discuss the problem in the language of weighing operations. 

A particular design is denoted by a matrix. The three objects to be weighed 
in four weighing operations may be weighed by the following design: 

‘ Journal Paper No. J-1405 of the Iowa Agricultural Experiment Station, Ames, Iowa. 
Project No, 890. 
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1 1 0 

w _ 10 1 

0 1 1 

111 

where the rows refer to weighing operatioDjg and the columns refer to the objects. 
In the above design the first two objects are weighed together in the first weigh¬ 
ing operation, the first and third objects are weighed together in the second 
weighing operation, etc. From the four resulting weights the individual weights 
are estimated by the method of least squares The design problem consists of 
finding matrices which will mimmize the vanances of these estimates. 

There are two distinct though closely related problems here. One is to find 
efficient designs for the case in which the measure of a combination can only 
be the sum of the individual measures. This would be the case, for example, 
in weighing objects with a spring balance and we shall refer to it as the spring 
balance problem. The other problem is to find designs when an individual 
measurement may be either added or subtracted in a combination. This would 
be the case in weighting objects with a chemical balance (since an object may 
be put in either pan of the balance) and will be called the chemical balance prob¬ 
lem In the latter problem the design matrix may contain O’s, I’s, and — I’s, 
whereas in the spring balance problem the matrix may contain only O's and I’s. 

We shall use Hotelling’s notation. There are p objects with weights h , 
bj, ■ 6p to be weighed m N > p weighing operations. The design matrix 
IS denoted by 

(1) , X = ||a:..|| a = 1, -.-.Wji = 1, •••,?. 

Denoting the transpose of X by X', let 


(2) X'Z = !la.,|| = lla”ir 

(3) Q\ “ ^ 


where ^ a is the observed result of the a-th weighing operation, 
estimates of the b, are 

(4) &. = £a”'fir, 

and the variances of these estimates are 


(5) 


= a <T 


The least squares 


where o-° is the error variance of a smgle weighing operation. The a" will be 
called variance factors. 

Hotelling’s main theorem states that or any design, a“ > 1/N, hence the 
best possible design is one such the inverse of the product of the design matrix 
by its transpose has its main diagonal elements equal to l/N. We shall call 
such a design an optimum design. Examples show that optimum designs do 
not exist for all values of X and p. 
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When an optimum design does not exist, the question arises as to how a 
best design shall be defined. In the present paper a design will be called best 
if the determinant of the matrix jj o’-’ 1| is minimized. A best design in this 
sense is, therefore, a design which gives the smallest confidence region m the 
hi {i = 1,2, ,'p) space for the estimates of the weights. 

In certain situations, other definitions of beat designs may conceivably be 
preferred. Thus, problems may aiise in which one might prefer: 

(a) to minimize the variance factors subject to the restriction that they be 
equal, (b) to minimize some function of the variance factors, or (c) to minimize 
only a certain subset of the on a minor of the matrix H j] as nught be the 
case when one wanted only rough estimates of the weights of some of the objects, 
but accurate estimates of the others. 

When an optimum design exists, the confidence regions are not only minimized, 
but, as Hotelling has shown, the variance factors are also minimized. It is not 
true in general, however, that a best design as here defined (minimum confidence 
regions) wiU also mi ni mize the variance factors. Examples illustrating this 
point are given in the last part of section 6 and the first part of section 7. 

3. Hadamard Matrices. The problem of finding the best designs is closely 
related to the Hadamard determinant problem. Hadamard [3] proved the fol¬ 
lowing result: If the elements of a square matrix X are restricted to the range 
— 1 < ajaj < 1, the maximum possible value of the determinant of X is , 
and when this maximum is achieved all a:a|j — ±1 and the matrix is orthogonal 
in the sense that X'X is a diagonal matrix; the non-zero elements of X'X are 
all equal to N. A matrix X which satisfies these conditions will be denoted by 
Hif . Obviously if Hy exists for a given N, it is the solution of the design prob¬ 
lem in the chemical balance case when N = p. 

With regard to the existence of Hh , it is known that a necessary condition is 

N = 0 (mod 4) 

with the exception of A = 2. It is not known however whether the above 
condition is sufficient, although it is known (Paley [4]) that exists for the 
range 

0 < 4fc < 100 

with the possible exception of 4fc = 92. Paley and Williamson [5] give methods 
of constructing Hu, in the given range (excepting 92) based on the theory of 
finite fields. 

When A is a power of two, Hjv is easily constructed by taking direct products of 
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Sylvester [6] first studied, this class of matrices and Kishen [7] has described 
weighing designs based on this subset of the Hk . 

The following examples of Hadamard matrices may be found in the literature; 
Paley [4] exhibits an Ha , Hn , and ffjs: Kishen gives an Hu ■ From these 
examples Hu and H^i may be constructed at once from the direct products 
H 2 -Hu and Hi -Hu ■ The following is an Hio : 
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where the signs represent ±1. This example was constructed by Williamson’s 
method [5]. Thus examples of H^k for the range 4 < 4fc < 32 are immediately 
available and methods of construction exist for the range 36 < 4fc < 88. 


4. Chemical Balance Problem. When iV = 0 (mod 4) an optimum design 
exists a Hy exists and is obtained by using any p columns of Hy. When 
JV # 0 (mod 4) we may construct very efficient designs as follows: If W = 1 
we may add a row of ones to ; if W = 2 we may add two rows of ones or 
a row of Hi’s to Hif-i ; and if W = 3 we may delete one row from Hir+i. The 
worst of these designs will be obtained when two rows of ones are added to an 
Hy-i, and in this case the variance factors are 


1 W 4- 2p - 4 ^ 
lV-2iV + 2p-2"^W-2 


Since it is known that these factors must be greater than 1/N for the best 
possible design in this case, the above design will be quite near the best design 
for large N. 

For small values of N we shall consider only the case H = p, since if one 
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wanted to make N > p weighingH, he would normally choose to be a multiple 
of four because the gam in efficiency by using optimum designs is rather large 
for small N. In general more than p weighings M’oulrl be required because o-® 
is not usually known. Thus several additional weighings may be made in 
order to obtain several degrees of freedom for estimating cr‘. 

When Hi, does not exist we have already defined the, best design as one which 
minimizes the confidence region for estimating the weights; that is equivalent 
to maximizing j a,, | or minimizing \ a’ |. There may be. several designs with 
the same minimum, but we shall not give all of them. 1'hus when p = 3 the 
best designs arc 

+ +0 + + + I+ + - 

if=+-+, + — + and + — + 

— + + — + + — d-d- 

all of which liave yl = 10 (which ls considerably smaller than the value 27 
that A would have if an optimum design existed). Using the notation 

(a-‘) = (a^\ • • , a^, 

the first of the above designs for p — Z givoJi 

(a“) = ih h I) 

wliile the second and third give 

ia") = (i, h W. 

For N = p = 5, two best designs are 


d- 

“b 

■b 

d- 

— 

+ 

— 

— 

— 

— 

d- 

“b 

d- 

— 

d" 

+ 

d- 

"b 

— 

— 

d- 

-b 

— 

+ 

-b and 

"b 

— 

"b 

— 

d- 

d- 

— 

+ 

d- 

-b 

+ 

— 

-b 

+ 

— 

— 

+ 

-b 

"b 

-b 

“b 

d- 

— 

d- 

-b 


both of which have 

A = 3'2® and (a”) = (2/9, 2/9, 2/9, 2/9, 2/9) 
For iV = p = 6, a best design is 

d-- 

d--d- d- 

d- — —d-d- — 

-b-d--b-d- 
-b d- d- - -b - 

d- d- - d- - -h 

which has 

A = 5'*2''' and all a'* = 1/5. 
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For = p = 7, a beat design is 


+ 

— 

— 

— 

— 

— 

— 

+ 

+ 

+ 

+ 

— 

— 

— 

+ 

+ 

— 

— 

— 

-b 

+ 

+ 

+ 

— 

— 

+ 

+ 

- 

+ 

— 

— 

+ 

+ 

— 


+ 

— 

+ 

+ 

— 

+ 

— 

1 + 

— 

+ 

— 

+ 

— 

+ 


which has 

A = and all a“ = 1/6 

These designs were constructed by a method due to Williamson [8] which 
will be described in sections 5 and 7 It is interesting to note that no minor 
of an Hi is a best design for = p = 7, for any minor of an Hi gives 
A = 2'® < and all a” = \ 

6. Spring Balance Problem. W = p = + 3 When N = p and A'' = 3 

(mod 4) the best possible design for the spring balance case is determined by 
Hff+i if it exists Let K^+i denote a matiix formed from by adding or 
subtracting the elements of the first row of H^f+i from the corresponding ele¬ 
ments of the othei lows in such a way as to make the first element of each of 
the remaining rows zero Obviously 

I Hff+i I = ± I Hif^i I 

and excepting the first row, the elements of Ky+i are 0 and ±2 with the signs 
of the non-zero elements the same for elements in the same row. Let be 
the matrix obtained by omitting the first row and column of Kif+i , by changing 
all non-zero elements to -f 1, and by permitting two rows if necessary to make 
the determinant of Ljv positive. Then 

I Hjv-Hi I ~ 2*^ I Lw 1 

and it IS clear that, given Lv, one could reverse the procedure and determine 
an Hff+i. In the same manner, there is a correspondence in general between 
square matrices with elements ±1 and square matrices of one less order with 
elements 0 and 1 The ratio of the values of corresponding deternunants is 
always 2^ if their determinants do not vamsh, hence the 0,1 determinant will 
have its maximum value when its corresponding -f-l determinant has a maxi¬ 
mum value. Thus 1 1 is the maximum value possible for a determinant of 

O’s and I’s of order N, and the value is 

(7) I 1 = (AT -t- 

The variance factors are 


= m/{N -i- 1)“. 
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We knew in advance, of course, that the a* would be greater than l/AT since 
an optimum design cannot exist unless the design matrix has its elements equal 
to dzl, and we must here restrict the design to have only 0 and 1 as its elements. 
Since is a best possible design for the spring balance case, it follows that 
designs for the spring balance problem can be no more than about J as efficient 
as designs for the chemical balance problem. 

6 . Spring Balance N > p. When W > p the device used in the chemical 
balance case to get optimum designs cannot be used. For if we select p columns 
from an Lit we may get rows of zeros which would waste weighing operations. 
A different approach is necessary and a clue is pven by the designs . In 
these designs p is odd and the objects are weighed + 1) at a time in each 
weighing operation. We shall show in general that objects should be weighed 
^(p + 1) at a time when p Is odd, and we shall obtain a corresponding result 
for p even. 

Let P, be a m itrix whose rows are all the arrangements of r ones and p — r 
zeros (0 < r < p). (The s 3 nnbol should also have a subscript p but that is 
omitted because any specific value for p will always be clear from the context.) 

The matrix will have p columns and rows. Let Q be a matrix made up of 

matrices Pr arranged in vertical order. Let n, be the number of times Pr is 
used in constructing Q. Q is a weighing design for p objects and 



weighing operations. The matrix Q'Q will have diagonal elements 

(9) a = 
and non-diagonal elements 

( 10 ) 

The determinant of Q'Q is 

A = (a - 6)’’"‘[a + (p - 1)5 
and we may write A in the form 

A = c’’“*d 


where 

( 11 ) 


c = o — 5, and d = a -f (p — 1)5. 
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We shall prove the following theorem: 


If p = 2fc — 1 where k is a positive integer, and if N contains the factor 



then A will be maximized when nk = N 



and all other n, = 0. 


We shall demonstrate this statement by showing that if any n, (s 9 ^ fc) is 
decreased and nk is increased in such a way that N remains unchanged, then A 
will be increased. Let n, be reduced by an amount m so chosen that 


m! = m. 



is an integer; we may then increase njt by m' leaving N unchanged. It is readily 
found that these changes in n, and nj, produce the following changes in c and d: 


Ac = m 



Ad 



{k — s){k — s — 1) 
P(P - 1) 

I (fc — s){k + s) 

P 


both of which are positive on zero when s < k and A is necessarily increased. 

When s > k, Ac is positive but Ad is negative and it must be shown that the 
net effect of these changes is to increase A, we shall assume now that n, = 0 
when r < k. 


AA = (c + Acy~\d + Ad) - c^~"d < [c”"* + (p - l)c’’"'Ac](d + Ad) 

- c^'^d < c’’“"[cAd + (p - l)dAc + (p - l)AcAd] 


where in the second line we have omitted terms in Ac of higher order than the 
first. These terms are all positive since ^11 their fetors are positive. The 
bracket in the last expresaon on substituting from (9), (10), and (11), may 
be reduced to 


m 




1 


+ P(^)(fc-s)\* + a)(*-« - 1)]. 


and then to 


“ ©fe”' (r = 

+ PQ(fc-«)“(* + «)(*-»- !)]■ 

Each term of the sum in the bracket is greater than or equal to zero when fc > 1, 
r > k,s > k since the fraction is readily seen to be negative or zero under these 
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circumstances. The fraction vanishes only when k = 2, r = k, s = k+ 1. 
The other term in the bracket is negative but it is dominated by the term in 
the sum for which r = s, as may be shown as follows; The two terms in ques¬ 
tion may be written 




s(k — s) k — 8 + 1 


p - 1 




— l\(k — s)^{k -H s)(k — s — 1) 


and since n, > m, this expression is less than or equal to 


(s - [ 


s(k — s) + fc — s + 1 , — s‘)(k — s — 1)' 

---- 

p — 1 ps 


which is positive for s > i: since the bracket is negative as may be seen by 
factoring out and putting the result in the form 

(k — s + l)(s“ (p — l)fc*) — pfc(p — s) + (2s -f l)(k — s). 

Thus A4 has been shown to be positive and the theorem is proved 
The above argument has shown that Pt or repetitions of Ft, give more efficient 
designs than any other combination of the designs Pi, Pi, • -, Pk The ques¬ 
tion now arises as to whether these are the best possible designs. We shall 
show that they are by considering the matiices of section 6 which are known 
to give the greatest efficiency in the spring balance case. Let p = 4i + 3 

and let N = (21 + 2 )’ and suppose Lp exists (i.e. H^+i exists). Using P^ 

as the weighing design we find the an are 

att = 2N(t + l)/p 

a,j = N(i -1- l)/p i 7 ^ J. 

A single application of the design L, gives 

Out = 2(f -j- 1) 

Oij = t 1 i j 

and N/'p repetitions of L, gives an o,-, matrix with elements equal to N/p 
times the given elements for one application of the design. The two designs 
are therefore equivalent and Pi, is a best design. 

The variance factors for repetitions of the design Ph are 


a" = - —^— 
N{p- 1)® 


N = 0 Mod K 
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and these are nuninmm variance factors* as may be shown by an argument 
entirely analogous to that used in proving the theorem. Thus Pk is a design 
which not only mimmizes the confidence region for estimating the weights, 
but also minimizes the individual variance factors. 

Efficient sub-matrices of the Ph have not been studied except for small p, 
but we may point out that square sub-matrices of order p which are as efficient 
as Pk do not exist unless exists, for by the argument of section 4, it is pos¬ 
sible to construct from such sub-matrices Hence we cannot obtain vari¬ 
ance factors as small as those given by equation (12) when = p unless Hp+i 
exists. 

The situation here is analogous to that in the chemical balance case. By 
a proper selection of N we can obtain a design with the maximum possible ef¬ 
ficiency for any odd value of p. But here we are much more restricted in our 
choice oi N. In the chemical balance case N could be any multiple of 4 for 
which an Hk existed; in the present case N must be a multiple of p even in the 
moat favorable instance (p = 4< -f 3), and for some values of p it may be neces¬ 
sary that iV be a multiple of 

We now turn to the case in which p is even. The theorem corresponding 
to the one given at the beginning of this section is: 



If p = 2k where k is a positive integer, and if N contains the factor 



then A mil he maximimized when 


if/(J + J) 


and all other rir = 0. 

We shall not prove this theorem in detail. By arguments analogous to those 
used earlier, it may be shown that A is increased when either n, (s < fc) is de¬ 
creased and Tift is increased, or n, (s > fc -f- 1) is decreased and nj,+i is increased 
with N fixed. This done, we may put all rir = 0 except nk and nk+i and then 
maximize A with respect to these two variables subject to the condition that 


Uk 






= N. 


The values of Uk and nk+i which maximize A may be found by treating them as 
continuous variables and using the calculus. 

The variance factors for these designs are 


(13) 


4 p 
Np^^ 


N = 0 mod 



’ The author is indebted to a referee for suggesting this property of the design, and 
for several other valuable suggestions and corrections to the paper. 
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but these are not minimum variance factors. In fact one can obtain smaller 
variance factors than these by using only Pt in the design (omitting P*+i en¬ 
tirely) . In this case 

(14) a" = 4 ~ — IV s 0 mod 

N 

and 



(p - 1)" -f 1 p 

p2 p + 2 


when p > 2. 


We have not found explicitly the design which minimizes the variance factors 
for p even, but it appears that the design would be made up largely from P* 
with a small proportion of the design devoted to P/t+i . Thus (14) is very 
nearly the minimum possible variance factor. 


7. Spring Balance Designs for Small p. When p = 2, each object may be 
weighed r times by itself, and the two objects may be weighed together s times 
to give 


II a.. 11 


r + s s 
s r -f s 


and if A is maximized subject to 2r -j- s = W we find 


r = s = iV/3 


a" = 2/N 


provided JV is a multiple of 3. The most efficient basic design is therefore 


X = 


1 1 
1 0 
0 1 


in accordance with the previous section. When N is not a multiple of 3 the 
best design is obtained by using the first row of X for the odd weighing when 
iV = + 1, and the last two rows when iV = 3^ + 2. 

The case p = 2 is notable in that there is almost nothing to be gained by 
weighing the objects in combination. For the variance factors 2/N would 
be obtained by simply weighing each object separately N/2 times. The ad¬ 
vantage of weighing in combination is only that square confidence regions in 
the hi, hz space are replaced by ellipses with somewhat smaller area. If a” = 
(r -t- s)/(r“ -h 2rs) is minimized subject to 2r s = JV, we find 

r = JV(3 - V3)A o’*' = 1.866/JV 

so that the a’* are reduced slightly from 2/N but at the expense of increasing 
the area of the elliptical confidence repona. 
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For p = 3 tlie most efficient design when iV = 3 is 


X = 


1 1 0 
10 1 
Oil 


as given by Lj or Pi. It is easily shown that for N > S, the most efficient design 
is given by repeating X even when N ^ 0 (mod 3). Thus for iV = 4 we would 
repeat one row of X, for JV = 5 we would repeat two rows of X, and so forth. 
The variance factors are 


a 


It 


_9_ 

iN 


AT = 3< 


m + 1 ) 

4(iV - l)(iV + 2) 

m +1) 

4(iV - 2)(iV + 1) 


AT = 3< + 1 


AT = 3« + 2. 


For p = 4 we may attempt to find by trial and error a sub-matrix of the 
design given by using Pi once and Ps once, but this would be a tedious process 
and the labor would soon become prohibitive for larger values of p. Hence 
another method must be found for obtaining the best designs when N = p 
except when Lp exists A method is provided by Williamson [ 8 ]. Let Dp 
be the best design for N = p. Williamson shows that when p < 7, I>p_i is 
a minor of Dp, hence Dp may be found by adding a row and column of variables 
tp Dp-i and expanding the determinant of the result by the bordered expansion. 
For small values of p it is easy to determine by inspection what values the 
variables should have in order to maximize the resulting expansion William¬ 
son determined Di and D^ by this method 
There are two types of D 4 which give a maximum value of A = 9 


1110 


10 0 1 

110 1 

and 

1110 

10 11 


0 0 11 

0 111 


0 10 1 


The variance factors are all 7/9 for the first of these, and for the second 

(a") = (7/9, 7/9, 7/9, 4/9). 

When N = 5, p = 4:, there are a number of designs which give a maximum 
A of 19. None of these however has all a” equal, and we shall give only one 
example: 


1 

0 

0 

1 

1 

1 

1 

0 

0 

0 

1 

1 

0 

1 

0 

1 

1 

1 

0 

0 


X = 
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which has 


(a“) = 


\19’ 19’ 19’ 19/’ 


When N = Q, there appears to be no design superior to Pj. It has variance 
factors all equal to 5/12 and A = 48,—a very large gain in efficiency over 
N = & &t the expense of one additional observation. 

When p = 5 there are three types of Di which give A a maximum value of 26, 
none of which has all variance factors equal. An example is 


with 


0 

0 

0 

1 

1 

0 

0 

1 

1 

0 

0 

1 

1 

0 

1 

1 

1 

0 

1 

0 

1 

0 

1 

0 

1 



w w w n 

25’25’26’26’26/’ 


For p = 6, an example of a Da with all a*‘ equal which maximizes A is 


1 

1 

1 

0 

0 

0 

1 

0 

0 

0 

1 

1 

1 

0 

0 

1 

1 

0 

0 

0 

1 

1 

0 

1 

0 

1 

1 

0 

1 

0 

0 

1 

0 

1 

0 

1 


with A = 81 and a" = 11/21. This example was constructed by the bordered 
expansion method from Df and it turns out to be a sub-matrix of Ps. It is not 
as efficient as Ps, however, since substitution of iV = p = 6 in equation (14) 
gives a" = 13/27, Hence we have shown that there does not exist a minor of 
Pj (for p = 6) of order 6 which is as efficient as Ps itself. 

For p = 7, there is a most efficient design given by Ln . 


1 

0 

1 

0 

1 

0 

1 

0 

1 

1 

0 

0 

1 

1 

0 

0 

0 

1 

1 

1 

1 

1 

1—1 

0 

0 

1 

1 

0 

0 

1 

1 

1 

1 

0 

0 

1 

0 

1 

1 

0 

1 

0 

1 

1 

0 

1 

0 

0 

1 


with A = 2“ and all o“ = 7/16. 

Dp for p = 8, 9, and 10 could presumably be constructed from D? in the same 
way and the designs for p = 4, 5, and 6 were constructed from D», but the 
computations become very tedious for these larger values of p. 

The designs given in section 3 were constructed from the above designs by 
the method described in section 4. 
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8. Bias in Measuring Devices. In some kinds of experiments it may be 
necessary to estimate a bias in the measuring scale m order to estimate the meas¬ 
ures of the objects Such a bias may amply be regarded as an additional 
object to be measured except that it is an object which must be included in all 
the measuring operations. In the chemical balance case the bias presents no 
difficulty, for if an Hat exists, then there exists an with a column whose ele¬ 
ments are all -f-1 • Such an may be constructed from any given Hs by merely 
changing the signs of all elements m rows which begin with a minus sign. The 
result will be an Hk with -|-l’s in the first column and that column may be 
assigned to the bias We note that the gain m efficiency by measuring objects 
in combinations is even greater in the case of a biased measuring scale than when 
there is no bias For if the objects were measured separately, their measures 
would be estimated by the difference of two scale readings and would have vari¬ 
ance 2(r“; hence the variance factors a" are to be compared with 2 (rather than 
1) in the case of bias. 

In the spring balance case, the additional restriction that all the elements of 
one column be one necessarily reduces the efficiency of the designs in the sense 
that the variance factors for p objects and a bias will be larger than the variance 
factors for p -f 1 objects without bias. When the measures of p objects and 
a bias are to be estimated from W = p + 1 measuring operations, a best design 
may be obtained by adding a row of zeros and a column of ones (in that order) 
to the best design for iV = p without bias This can be seen by recalling that 
there are two determinantal exressions for the volume of a simplex with one vertex 
at the origin in a Euclidean p space (A simplex (Sommerville, [9]) is a polytope 
with p + 1 vertices bounded by p -f- 1 (p — 1)-dimensional hyperplanes) The 
determinant of the best design for iV = p (without bias) is proportional to the 
volume of the largest simplex with one vertex at the origin and the other vertices 
restricted to be selected from the vertices of the umt cube. A determinant of 
order p 1 with a column of ones and the other elements zero or one also gives 
the volume of a simplex with vertices selected from the vertices of the umt cube. 
Hence the two determinants (one of order p and one of order p -f- 1) must 
have the same maximum value, and as one of the vertices may be selected ar¬ 
bitrarily in the case of bias, we may select the origin. 

In general, for N > p, similar geometrical reasomng will show that the best 
designs for the spring balance problem in the case of bias are easily constructed 
from Hadamard matrices as described in the followmg theorem: 

If X is a best design for ihe chemical balance problem in ihe case of bias and if X 
contains a row of -f I’s, then a best design for the spring balance problem in ihe 
case of bias is given by replacing the — I’s inXby zeros. 

We have seen that the best design in the chemical balance case is obtained 
from a Hadamard matrix 'with a column of -fl’s. Obviously the matrix may 
be also made to contain a row of + I’s by changing the signs of certain columns. 
The design X consists of the colunm of ones together -with any other p columns 
The determinant of X'X is 1/p'* times the sum of squares of the volumes of 


a set of simplexes in a p space. 


There are 



of these simplexes deter- 
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mined by the different combinations of the rows of X taken p + 1 at a time, and 
the coordinates of their vertices are the last p elements of the rows of X. The 
vertices are therefore selected from the vertices of a cube in the p space which 
has its edges parallel to the coordinate axes, the origin at its center, and the 
lengths of its edges equal to two. Since X is a best design, the vertices are 
selected so as to maximize the sum of squares of the volumes of the simplexes. 
Now in the spring balance case we must maximize the sum of squares of the 
volumes of a set of simplexes which have their vertices selected from the vertices 
of the unit cube. Obviously this may be done by selecting vertices correspond¬ 
ing to the selection given by X. Thus it is necessary only to set up a cor¬ 
respondence beteen the vertices of the two cubes Since X contains the vertex 
(1, 1, 1, 1, ••',!) which is common to both cubes, the natural correspondence 
which identifies a vertex such as (1, — 1, — 1, 1, — 1, 1, ■ ■ ) with (1, 0, 0, 1, 
0, 1, ■ • •) may be used. 

The variance factors for these spring balance designs are 4/N (for any p <N) 
when N is a multiple of four and exists; when N is not a multiple of four 
and modifications of Niv as described in section 3 are used, the variance factors 
will differ from i/N by terms of order ^/N^ 

9. Addendum. After this paper was written, the paper of Plackett and 
Burman on “The Design of Multifactorial Experiments” appeared m Biomeirika. 
Volume 33 (1946), pages 305-325. A part of this paper discusses Hadamard 
matrices much more completely than we have done in section 3 In particular 
Plackett and Burman have constructed all Hadamard matrices of order less 
than or equal to 100 (excepting 92). 

REFERENCES 

[1] Habold Hotelling, “Some improvementB in weighing and other experimental tech¬ 

niques,” Annals of Math. Stal , Vol 15 (1944), pp 297-306 

[2] F Yates, “Complex experiments,” Jour Roy Slat, Soc Supp , Vol 2 (1935), pp. 181- 

247 Reference is to page 211 

[3] J Hadamard, “Resolution d'une question relative aux determinants," Bull, des Sci 

Math (2), Vol. 17 (1893), Part 1, pp. 240-246. 

[4] R E A C, Paley, “On orthogonal matrices," Jour, Math and Phys,, Vol, 12 (1933), 

pp, 311-320 

[5] John Williamson, "Hadamard 's determinant theorem and the sum of four squares,” 

Duhs Math Jour,, Vol 11 (1944), pp. 66-82 

[6] J. J. Sylvester, “Thoughts on inverse orthogonal matrices,” Phil Mag. (4), Vol. 34 

(1867), pp. 461-476. 

[7] K. Kishbn, "On the design of experiments for weighing,” Annals of Math Slat., Vol. 14 

(1946), pp. 294r-301. 

[8] John Williamson, “Note on maximal determinants,” Ato Math Mon., Vol 63 (1946), 

pp 222-224 

[9] D. M. Y. SoMMEHViLLE, Introduction to the Geometry of N Dimensions, London, Methuen 

and Co., 1929 



THE APPROXIMATE DISTRIBUTION OF STUDENT’S STATISTIC 

By Kai-Lai Chung 

University of Peking, Kunming, China 

Summary. It is well known that various statistics of a large sample (of 
size n) are approximately distributed according to the normal law. The asymp¬ 
totic expansion of the distribution of the statistic in a series of pothers of n~^ 
with a remainder term gives the accuracy of the approximation. H. Cramer 

[1] first obtained the asymptotic expansion of the mean, and recently P. L. Hsu 

[2] has obtained that of the variance of a sample. In the present paper we 
extend the Cram6r-Hsu method to Student’s statistic. The theorem proved 
states essentially that if the population distribution is non-singular and if the 
existence of a sufficient number of moments is assumed, then an asymptotic 
expansion can be obtained with the appropriate remainder. The first four 
terms of the expansion are exhibited in formula (35). 

1 . In a fundamental paper^ P. L. Hsu [2] has devised a method for obtaining 
the asymptotic expansion of the distribution of various statistics. The present 
paper deals with the so-called Student statistic. 

Let 


£l > ^2 > ■ ■ ■) fn 


be n independent random variables having the same probability distribution 
represented by a distribution function P(z). The rth moment and rth absolute 
moment are denoted by ur and Pr respectively. It is assumed that aj = 0 
and that for a certain A: ^ 3, and that a 2 > 0. Hence there is no loss 

of generality in assuming that a 2 = 1. 

Student’s statistic is defined as 


n \-i 

E («r - m 

r«l I 

, n(n-l) / 


where I = - E fr • 
n 


For brevity, we consider 

"td; (f. - ()’)''■ 


Let its distribution function be denoted by F(z), i.e , 

(fr - ^ zj = F(z). 


1 The definitions of the various constants A, A^ , Qk , , t>, @, 0* , are the same as 

in Hsu’s paper. 
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Discarding the case k = 3 where we can prove a more precise result and the 
singular case which can be shown to admit no asymptotic expansion in. the sense 
of Cram6r [1], we shall prove in this paper the following theorem: 

Theorem. Let P{x) be non-singular and mk < °° for some integer it ^ 4. 
Then 

(1) P{z) = Hz) + x(z) + R{Z), m = dy, 

where x(^) w a linear combination of the derivatives • ■ •, vnth each 

coefficient of the form ^ v ^ k — 3) times a quantity depending only on aj, 
■ ■ •, afe_i whose beginning terms are given in (35) and where 

(2) I Riz) I g Q,a + 1 2 = 

where Q* ts a constant depending on k and Fix) / 

We shall need some of Hsu’s lemmas, i.e., his lemma 3, lemma 7 (both for 
the particular case m = 2) and lemma 8. These we shall quote with this num¬ 
bering. The application of Hsu's method to Student’s statistic depends on the 
followmg lemma. 



2. Lemma A. For u ^ — L Z ^ 1, we have 


1+ v_ 


D_ _ (1+’g 

r(j + i) \ r(|-j)r(j + 1) 


^ Vi + « g 1 + E - 7 s —^ - 

^(1 "■ ^’) 

Proof. By Taylor’s expansion of ■y/l u, we have 



whence it follows that (1 + is finite, and positive. The right- 

hand side inequality follows. 
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Similarly, if u ^ 0, 


2J-1 

vtt^ & 1 + i: 


il) 


i 1+ s 


<0 


U> + 


'© 


a-) 


T(2l + 1) 


’■‘r(|-j)r(, + i) 


21-1 (—i)'r 


since by a well-known result on the binomial theorem we have 


For — 1 ^ u < 0, we have 


= vT^ = 0. 


r(i +1) 


SI-1 

1+r 


(D 


For — 1 ^ u < 0, we have 

(I) 


, N 

tt - VI + M = ^ , say. 


21-1 

f) = 1 + E 




+ \/i -f- 


u 


21-1 

^ 1+ E 




Next, 


r(|-i)r(i + i) 


21-1 

N = ii+2:. 


"r(|-i)rO+i) 

is a polynomial in u of the form 

U (qq “t~ ®lW "1“ • * • 4” ChlU^^} 


u’ \ - (1 -1- n) 


where Oo > 0 and the successive coefficients have alternating signs; hence for 
— 1 < u g 0, flo + OiU + • • ■ -h a 2 ju** assumes its maximum at u = — 1. This 
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maximum is obtained by putting u = — 1 in the numerator, hence for — 1 § 

M < 0, 


21-1 (^1) 
1+ E-75-; 


^“r(|-i)r(i+ 1) 


The left-hand side inequality in the lemma now follows. 

For brevity we write the inequalities as 

(3) 1 -f PjjCw) = 1 -f Pii-i(u) — hjj ^ -s/l -b u ^ 1 -b Pjj_i(u), bn > 0. 


3. We write 


E (ir - f)^ = E fr - nf = n -b Vn(ai - 1) X ~ Y\ 


where 


x = t^i==L= y = Vn?. 

r-l V ~~ t) 

Then Student’s statistic may be written as 

ni (I; tt, - {)f - y (i + x-ty. 

Then, for every z, we have 
F(z) = Pr -b X - Y^y ^ z| 


For brevity let 


yi + jr-y, 

Suppose z ^ 0; then we have by (3), 


04—1 


X = U. 


2 + zp 2 i-iiU) ^ z vT+T ^ 2 -b zPniU) 

Pr{V ^ z -b zPn^iiV)] ^ F(z) g Pr {7 g z -b zPs,(t/)} 

Suppose z > 0; then we have by Lemma A a similar inequality with the 
extreme terms exchanged. 

Now we take I = fix it henceforth. 



student’s statistic 


451 


Our next step is to obtain an asymptotic expansion for 
Pr{V g z + zP™(C/)} = F/-|y < z (l 


with m = 2Z — 1 or 2Z, Z ^ 1 . 
Let b be any real number, and 




Until section 12, we shall write simply L{x) for either of the Lmix). 

4 . Let W be the probability function of the distribution of the random point 
(X, Y) and let/(ti, fe) be the characteristic function. 

W(S) = Fr{(X, Y)eS} for every Borel set /S in SI 2 

M, in'. 

i(lCa4—1)"^®*— 


p(ti , fe) = f 

a 




Then 


( 5 ) Pr{Y £b + L(X)} = Jf ^ JJ + H H 

where 


vS!>+r(*) 


VST> 


G{x, y) = < -1 
. 0 

We approximate G{x, y) by H{x, j/), where 


if b < 1 / g 6 + L{x), 
if b + Lix) < y ^ b, 
otherwise 


H{x,y) =^-e-" 

. 0 

We approximate dW by iw{x, y) + y{x, y)) dx dy, where 


il b < y ^ b + Lix) 
if b + Lix) < y ^ b 
otherwise 


— 00 •'—so 
j* 00 i* so 


yix,y)= r re-*‘‘*-^’'’>(<i,Z 2 )iA(i<iVZ 2 )dZidZ 2 

•Loo •'-00 

««,,« - p - E (- 7 ^=) - 
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and f( 7 ij, iu) is given in Lemma 3 by taking therein 

being any of the f.’s. 

We write 


e ~ 1 

a/q!! — 1 ’ ( 


( 6 ) 


f f 0(x,y~u)dW 

*A~0O tf—tc 

/ “> r* 

^ {Gisc.y -u) - H(x,y - u)) dW 


L L ~ ~ “)) + yix,y)) dy dx 

/ a> -m 

/ fJ(x,y - u) dW 
00 ^->00 


~ /- L + 7(a;,2/)) dy dx 


/ •o »« 

w L„ “ '^{^ix,y') + y(x,y)) dy dx 


5. We have 

I y -u) ~ H(x, y - a) I S 1 - g «x*' 

I L L H{x,y - u)) dW I £ £ dW = eS(X’‘) g Q*e 

since 


where Q* depends on as, • ■ •, aj*. 

Similarly, 

I /- / « ~ - tt))(i^(a;,y) + 7 (a;,y)) dy dx 

Next, 


^ Qkt 


L L ^^^’y ~ + 7(®,y)) dydx 

(10) = / / ^'^(=^>y)+yi==,y))dydx~ J j (y,{x,y)+y(x,y))dydx 

wWe the first term on the right-hand side, regarded as a function of n-* has 
a Taylor expansion m powers of n-, whose first few terms we shafi comoute ^ 
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section 9; for the present let us denote it byB(u + 6) + C(u + where 

C = C(u-jr h) is Sb constant depending on k, P{x) and z, a more explicit estimate 
of which will be given in section 10. 

Further, we have 

(11) r r f J dW- f JdW 

V^u+b+L(x) l/gu+i 

by Cramer’s asymptotic expansion for the mean ■%/ nY, and as is also shown 
in Hsu’s paper we have 

(12) / J dW ~ J J (w(x,i/)+y(x,y))di/dx=Ain~^^'^''^ 

Collecting all the results from (5)-(12), we get 

j J dW - B(u-i-b) - C(u +6 

Vg u+b+L(x) 

= + £ £ H(x,y - u) dW 




H{x,y - u)iw{x,y) +yix,y)) dy dx 


Now we use A. C. Berry’s weighting factor and obtain 




£ 1^ J J dW-B(u + b) - du + 6)^ du 


V^U+b+I>(«) 

= AbTie + 


(13) 


r 1 - cos ^ /£“ 

JL« Vr j-eo 

-LX H{x,y — «)(w(x,y) + y{x,y)) dydx'^du 


since 


rLL£^du = .T. 

JL« v,^ 

6. To transform the triple integral on the right-hand side of (13) we use the 
Fourier transform as Hsu did. 

Let 



'‘t'lfjix, y) dy dx = h(ti,tt ); 
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r r 

DQ immOQ 


-itix-UiV 


r r - u) dy dx = e-^‘"‘h(tx, k) 

J—00 d— -71 

""(Z ^ Hix,y ~ u) du^ dy dx 


•^60 U Q 


_ f’lCr I h I )^(ii ,is) t/1 fa 1 '< T 

\o otherwise 

ff(r - I fa I) if I fa I < r 


otherwise 


By Fourier inversion we have, almost everywhere, 


— cos Tu 


H{X, y-u)du ^itl^+ihv^rp _ I ^ 


Hence 


ri ~ COSTW f" r rr/ s , 

I - -2 - / / J^(a:, y ~ n) dlF dw 

*^oo U J—q 9 */—00 

~ ^ {T I fa |)/(fa, fa)h(fa, fa) dtidtx 


Similarly we obtain 


/'°° 1 — COS Tu f f°° f“’ \ 

jL„ ^^2 -jL„ “ m)(w(®, 2 /) + T(a:, y)) dy dx) du 

r T ^ 

~ ^ L„ Lt ~ ^ ^ Di^Cfa > fa) (1 + 4'i‘^ti, ffa) }/i(fa, fa) dfa dfa 

From (14) and (15) we obtain 

f" 1 - cos / r r 

L —— \L L 

~ J L y ~ '“)('“’(*! y) + 7(®j y)) dy dx] du 

(16) “ ” f 

— </>(ti, fa)(l + ^(iti j ifa))};i(fa , fa) dfa dti. 

7. To estimate the double integral on the right-hand side of (16) we break 
it up into parts and use the following estimates of h(ii, fa). 


Lemma B. We have for I = I— I ^ 
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( 1 ) 

( 2 ) 




/-I 


h{k, h)\ ^ Qktfe Ni I h I, rr'‘\ 


where •A/'( ] |, is a polynomial with constant coefficients in the indicated 

arguments. 

Proof. 

I hih , fe) I = ff e-'**-'*"-'**' dydx- f j dy dx 

KvSi+iC*) Ii+l(3i)<»SIi 

= (f - f f ) dy dx 

\JHx)-^oJb J Liz)<e Jh+L(x)/ 

= r - 1) dx. 

J-flO 


Hence 


Since 


we obtain 


\h{h,t2)\ g l^r r \kL{x)\e-‘^’' dx. 

•L-oo 


21 


I L(x) I g 12 I IZ (“4 - 1)^' n“*' I X I 

J—1 


h{ti, fc) 1 g l2 1 23 («< - f U I' 

J-1 


e-““ da: 


Next, we write 


j-i 


h{ti, <2) = (-1/2) u''(x)v(x) dx 


with 

= fi-’*’", v(x) = - 1 ). 


Integrating by parts twice, we get 
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whence 

j—CO 

+ eV'-' 1 i(®) 1 + 1 11 L’\x) 1 }dx ^ 2 1 + 2 ^)Ar(l h I, n\ r'‘) 

The lemma is proved. 

Now we write 

f" lfcl)(;-.^(l +V')lAdfcdti 

JLqq J-f 


(17) 


= II If II + + 

MlliS<3*"i |lll>0*n» |ll,IS0*t.» 


On Ii we use Lemma 3 and Lemma B, (1): 

lLi^(3*l2l 

w—«o '^‘00 ^ 


•|e (1 «.• r + • • • + u. dh di, 


11 


J -.1 


On I 3 we use Lemma 7 and Lemma B (2), Since 1 h 1 > Q*w*, 1 4*(1 + if') 1 ^ 
e”"*’*, and by Lemma 7, p(<in“*, /jn~*) = e"®* so that 1/(4, fe) 1 S \f — 
$(1 + I ^ e'"”* 


h^Qiz' jj rtr® e-®*jv(I fc I, n-*, r''“) * dh . 

|li|>a*nt 

Let € = n~^, /3 > 0, then it is evident that 

]Ii\S QifcZ*. 

Similarly using Lemma 7 and Lemma B, (1) on U we see that 

1 /. I ^ Q* 1 2 I. 

Therefore 

4 t I Ir ~ ^ f* l){/(fi» ■" > f*)(l ■)' » *k))}h(ti, 4) dk dii 

^ Q* ( 1 21 + 121 
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8. Combining (13), (16), (17) we obtain 


(19) 


L - -( If <^W-B{u + b)- C{u + du I 


k£u+Ii ♦-£(*) 


^ Qk (re + + 121 4- 2® + IZ I E . 

Now w§ shall choose T and « suitably. Ijet 

T = n“, e = n"^ a > 0, |3 > 0. 

To make the right-hand side of (19) a constant depending on z only, we must 
have a ^{k — 2) ‘ S a- Then 

^ ~u+i)ni _ ^ 

j-i i-i 

We must choose < k/2, then 

J-1 

To make the exponent as small as possible we choose P = a, then 
1 2 1 i: n-*' ^Ak\z\ = Ak\z\ 

j-1 


since a is to be as large as possible, we choose 

. (k-2 (Jc- 1)Z\ 
a - 0-0 - mm^ ^ ' 2{l1)}' 

Then we obtain 

£“ 1 - ^ II ^_2[u + h)- C(u + 5)n-*'‘-«^ 


=[a- 


du 


( 20 ) 




^ Q*(l + 2*)- 


Let F*iu) be the distribution fimction of Y — L{X), and let 
Fi(u) = B{u) 4- C(u)n"‘'*"*’ 

Then we may write (20) as 


( 21 ) 


1“ 1-CO^ 


^ Q*(l 4- 2*). 


By the definition of Fi{u) we see that the conditions in Lemma 8 are all satisfied 
with a certain constant D depending on k, P(x), and z for the M therein. Then 
choosing b to be the a in Letnma 8, we obtain from Lemma 8 and (21), 


( 22 ) 


Draja ^ d® - irj g <2*(i 4- 2 ’) 
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where 


a = Ai.u.b.lF*(M) -TO|. 

Now there exists A such that if T5 > ^, then 

/.n 


„ f 1 — cos a: , . . 

3 / -; ax — IT > 1, 

Jo 


hence it follows from (22) that 

Tfi ^ max {A, + z*)). 

Thus for another Qu exceeding both A and the above Qk , we have 

n ^ Qi(i -1- z“) 
and so finally, dropping the prime, 

(23) I F*{u) ~ Fkiu) I ^ Q*(l + z*)Dr“' = Qt(l + z^)Drr'*\ 

In particular, taking 6 to be z(l + n~V)“* = z', say: 

(24) Fr[Y ~ L{X) g z') = -8(2') + ^(zOn"^^"’^ + At(l + z^)I>n‘ 
where 

B(z') + C(z')n"*^~^' — the Taylor expansion with a remainder of 
J j (u)(x, y) + y(.x, y)) dy dx 

V— 

and D is an upper boimd for 

1 B'(u) + C"(«)n-*'*-*’ 1. 

9. Let X = n~*, and rewrite the z' + 2^i_i(x), I ^ 2 there as ff(X): 


gO^) 

Then 


= z' (1 + 


(«4 — 1 ) 


1/2 


■ Xx 




J/(0) = z' 
?'(0) = 


g"iO) = - 2 '** 


£/"'(0) 


_Z 1 -'- 
4 

_ 3(^ - 1)*'^ ,, 
8 


z'x*. 
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5P+V 


Let p g Oj 3 ^ 0-,Wp,(x, y) = where w{x, y) is defined in section 

4 and we know that 


—(**--2pxv+i/*)/2(l—p*) 


Let 

(26) 

Then 


"’<*■ S'’ ■ wr=T=' 

a aO) 

iOp,(x, y) dy dx. 

so 


(27) 


/;,(x) = r 

J—CO 

/';«(x) = 

J—CO 


pq (x, g(\)) dx 

ig"(\)Wpt{x, ff(X)) + g'“(X)iiJ,,(,+i(x, ^(X))) dx 


fp'A) = (g'"(^)Wp,(x,gi\)) + 3i?"(X)3'(X)tn,,,+i(x,ff(X)) 

+ 9 '^M‘Wp.t+tix, gi\))) dx 


Let 


(28) 


»=*(o = ^ I.- £ *<*> 

/ X iapg(Xj 2 ) dx 

We have computed the following table of values of Jpg: 


\ V 

T \ 

0 

1 

2 

3 

S 4 

0 


0 

0 

0 

0 

1 



0 

0 

0 

2 


2p$(«+« 


0 

0 

3 


-3#'*’ - 



0 


Next, we find, from (25)-(28), 


(92) 


/oo(0) 

= $; 


UaiO) 

= /p. 5 _i for g ^ 1; 


fvM 

m — 1)* V7-1 

4 


Cm 


04—1 ,lj2 

^ 3 i P.B+1 

Cm 

3(04 - 1)’'^, ,3 

8 

3(04 — 1)*^^ J_ 

g 2 1 p,,+l -l- 


\»n 


».«+2 
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Now we can expand 

« I- /<b(M 

j j wiz,y) dydx = £ wix,y) dy dx = /oo(X) 

Write the Taylor’s series for /«i(X): 

/oo(^) = /oo(o) + /o'o(o)x + X* 4- X’ -f- • ■. 

Substituting from (29), we get 

L C' "<«> ■'I' * “ * - O''*"’ 


(30) + ^ I 1 

+ (3!'(-a;*“> - pV) - 3*'’(-3p*“> - pV‘>) 

+ + • • • 

Further, we must obtain the beginning terms of y{x, y) as given in Lenuna 3, 
for which purpose we refer to Hsu's paper. We have, in fact 

6n‘/^^V24 36/n^Vl20 72 ^ 216^*/* ^ 

where 

- + 3 *1 + 3 2^ w! + i! 

04—1 — 1)*'* 


= («4 - S)t\ + 4 + . . . 

U, = E(^ih - IQE (h + <^) 

= (as — 10as)/S + • . . 


To avoid the exhibition of very long expressions, let us separate the terms 
in ypiih, iU) according to the powers of and denote the terms of the power 

“l/S “1 - j ? j « * 

n n ,n by , i('a, respectively. 

Thus and the corresponding 7 ( 1 , y) is 
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(31) 


Ti {x,y) ~ + 3-\/cn — 1 Wn{x,y) 


+ 3 ^ wii(x,y) + 

a< — 1 / 

where, as hereafter, the terms omitted will yield nothing in the long run, 
Now we have by (31) and (26), 

po(K) 

/ / yi(x, y) dy dx 


“ “ ^«8/o 3(X) + 3\/a4 - l/i4(a:, y) + 3 y) + • - 

= (os/mO) + 3 VST^MO) + 3 - ^1^^ /n(0) + • • ■) 

(“3/m(0) + + 3 + .. .^ 

“ & ("3/oa(0) + 3 V^;rn'/«(0) + 3 ^ ~ - ^^ /n(0) + • • •) 
(32) = ” 0^2 ("a-^os) ~ ” 12n “I" 3 Vai — 1 lli) 

+ ^ f («8iSa + 3 v^r^ii. + 3 ii^ 

- 2 '® ^a, iSi + 3 Voi - llli + 3 i22^| H- 

= - + 3 

+ f' + 6 + 6 

- 2 '“ [a3($«’ + p"$«’) + 6 + 6 + .... 

Similarly, omitting the intermediate steps to save space, we have 

/ ** rpCM 1 

„ jL„ 

- T^’ f 


+ 2ci(jp4''** + 12a3\/c4 — 1$® 


+ •••. 
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fee -ff(X) 

/ yi(3!, y) dy dx 

BO *^~ao 

_ JL / «5 - 10^8 ^( 4 ) , ^ «) , ^ (sA 

n^i2\ 120 ® ^72 ^ 216 / ^ • 


Combining (30), (32)-(34) and simplifying, we obtain, as the first four terms 
of the asymptotic expansion of F{z)i 

j j (wiz, y) + 'rix, y)) dy dx = ^ - + 4>®) 


v-LMSt' 


1 


4n I 6 


+ z>(d + 2(^4 - 1) - a, ^(j) _ at - 1 

\ 3 2 2 / 

+ »'■ #»+ f i”’)} 


(36) + - i_ ^ ^(81 I ga ^(61 

^ ^ ^ 2471^/2 \ 5 ^3 9 

+ z' r ~ ~ 9Q^a«4 ^(i) Taaiot — 1 — a;) — 2at 

L 2 2 

, ftaCaa — 5at + 7) (6) _ ^ (?) 

2 3 . 

_l_ g /2 |~ 9Q:3a4 + 3 o! 3 — 6at ^( 2 ) ai{3al — 7at + 7) J 
+ z'’ |^ -3a3(^ - 1) ^(j) _ ^ ^C6) J| ^ ... _ 

10. In order to estimate the remainder in the Taylor expansion 

we write, in accordance with Lemma 3, 

£ « i*0(X) 

/ {w{x, y) + y{x, y)) dy dx 

ao 00 

/ * ^oc^) r A—3 

« i .0 ^ X''2(-l)'*'^’'’a,j^,w;,j,,(x, y) \ dy dx 

= /oo(X) + 2 X'2(-l)'‘+'*o.,,J,,,.(X) = i:/^5>(0) -! 

y~l ,-0 J ! 

V A“2 h —3 

+ /o'r’( 0 X) + Z X'S(-l)'‘+'» o,,.. 

J \ '* J'fj) //\\ ^ I 2 —J»)/an\ a I, 


•|z /J;i,(0)^+/'j7r''(ex)-^ 


(fc _ 2 - >;)! 
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- BOT + AiX‘“’ (/o"’(«X) + E/!"“'’(#X)^ . 

Thus 

C{z') = A*(/r“’(9X) + 

Now we may write 

= f S(n(ff‘*\ex))')u)„(x, ! 7 (ex)) dx 

J—ao 

where, if we attach a weight s to g^‘\ffK), the polynomial under the integral 
sign is isobarifi of weight fc — 2 — k in these g^’^’s, and the coefficient of each term 
is a constant multiple of a certain Wpq{x, g{d\)). Further, it is easily seen by 
induction that we have 

9 ^‘\ 6 \) = Pi 4*.(2)(1 + e’xV)-*- 

where Pi+j* ( 2 ) is a polynomial of the three variables z, x, B\ which is of at most 
the (1 + 2s)th degree in z and of the (21 — l)st degree in x, and whose coef¬ 
ficients are all Aj. 

Therefore, 

I/m»/ I ^ f Q*(l a: I + X* + •' • + I a: ]*' ^) 

■(l + \z g(d\)) dx 

= £ <2*(l x I x^ -b ... -b I X 1“-^)(1 -b 1 Z dx 

S Q*(l + 1 z 

Thus 

(36) (7(2') g (3*(1 -b I z T"’) 

Lastly, an estimate of D is easy; 


— / I (w(x, y) -b y(x, y)) dy dx 

Q/H J—ec J—« 

^ f (w(x, u -b -L(x)) + I 7 (x, M -b L(x)) |) dx g Q*. 

J—00 


Collecting the results of (24), (36), (37) we obtain 

Pr{7 - L{X) g z') = B(z') + At((l + j z + (1 + sV”""), 



(38) 
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Or, more simply, 

(39) 1 Pr(y - L(X) g - B(z') j S Q*(l + \ s •, 

where the first four terms of B(z') are given by (35). 


12. To return to F{z). We see that B{z') depends on the function L{x). 
Recalling section 3 we now write for the B corresponding to , with m = 
2 ( - 1 or 21. 

Then by (4) the value of F(z) lies between 

Pr{y - Lu-i(X) g 2 '} and Pr{F - Li,(X) g 2 '}. 

From the asymptotic expansion just obtained for either of them, we see that 
the absolute value of their difference does not exceed 

I B,Uz') - B»{z>) 1 + Q*(l + I 2 l^*)n~“*. 

But 

hu{x) = L}i-i(x) — Z%i(a4 — l)*n“'a;“ = I/s;_i(a;) ~ bux^‘ say, 

hence 


I (e) 

/ , !«'(*» 1 /) + 'y(®, v) Idj/dx 

^ Qkbii g Q*la|n-' < Q* | 2 | n““«. 

Therefore 

1 Pr{r - L«-i(X) £ z>] - Prtr - L^^iX) g 2 '! 1 S Q*n'“» 
and 80 we obtain 

(40) F{z) = J5(2') + -Afcd + I 2 

which is equivalent to (2) in the theorem stated m section 1. 

Thus the theorem will be proved if the assertions regarding the form of f{z) 
in (1) are shown to be true. 

For this purpose we denote, as before, the terms of the order nr"'^ in iU) 
and 7 (x, y) by , y, respectively. Since the term in which yields a Wp, 
with the greatest q \b VI, we have for every w„, in y, the condition ^ 3v. 

/ yr(x, y)dydx to & — 3 — v terms, in which ffq{0), /pg(0), 

te J—wt 

* ■ occur. In the integrand of /'^’~’^(0), e.g., the coefficients of 

each Wpq{x, z) are polynomials in 2 and a: of a total degree in s and x not exceeding 
that of (p'(0))‘~““", i.e., 2(fc — 3 — r). Hence the expansion of y, will give 
rise to terms of the form 

, 8 g 3»-, « + t = 2(fc - 3 - v). 

Such a term wiU sdeld a term which in turn yields the terms with 
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J* ^ s 5 -j- i ^ 3v 2(i) - 3 - »') 3(11; - 3), 

Equality holds only when y = A - 3 and g = 3(i - 3). But when v = it - 3, 
the term in question is 

/o,a(U)(0) = Jo,}uo = 

Next, we see that i contains I/j, • • •, Uy+ 2 . Since (0) is a poly¬ 
nomial of the (k~ S - y)th degree in a;, the expansion of y, will yield 
• ’i5il’ ^ But = 0ifp>fc-3-y,hencep ^ fc-3-y. Thus 
in ii, we need only tajte account of the terms (fti)'( 2 l 2 )® with p | - 3 - y. 
Nowif j < fc - 3 - y,in t/jOnlyaa, •• •, occur. Ifj ^ A - 3 - y, 
in the coefficient of a term [itiYiikf with p ^ i - 3 - y the greatest index 
of a is 


2(i “ 3 - y) + j - (A - 3 - y) = j + fc - 3 “ y ^ fc “ 1 

since i ^ y + 2. Hence in the expansion of every y only aa, • - ait,i occur. 

The proof of the theorem is completed 
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SOME IMPROVEMENTS IN SETTING LIMITS FOR THE EXPECTED 
NUMBER OF OBSERVATIONS REQUIRED BY A SEQUENTIAL 
PROBABILITY RATIO TEST 

By Abraham Wald 
Columb%a Umversily 

Summary. Upper and lower limits for the expected number n of observations 
required by a sequential probability ratio test have been derived in a previous 
publication [1]. The limits given there, however, are far apart and of little 
practical value when the expected value of a single term z in the cumulative 
sum computed at each stage of the sequential test is near zero. In this paper 
upper and lower limits for the expected value of n are derived which will, in 
general, be close to each other when the expected value of z is in the neighbor¬ 
hood of zero. These limits are expressed in terms of limits for the expected 
values of certain functions of the cumulative sum Z„ at the termination of the 
sequential test. 

In section 7 a general method is given for determining limits for the expected 
value of any function of 


1. Introduction. Let as be a random variable and let/(3:, 6) be the elementary 
probability law of x involving an unknown parameter 0. Let ffa denote the 
hypothesis that $ = do, and Hi the hypothesis that 0 = 0i, where 0o and 0i 
are given specified values. The sequential probability ratio test for testing Eo 
against Hi, as defined in [1], is given as follows; Put 


( 1 . 1 ) 


Zv 


= log 


/(a^t, gi) 
/(^i, &o) 


where x, denotes the z-th observation on a:. Two constants, a and b are chosen 
where a > 0 and b < 0. At each stage of the experiment, at the m-th trial for 
each positive integral value m, the cumulative sum 


(1-2) Zm = Zi • -p Zm 

is computed. Experimentation is continued as long ash < Zm < a. The first 
time that Zm does not he between b and a, experimentation is terminated. The 
hypothesis Hi is accepted if Zm S o, and Ho is accepted if Z,n g h. 

Let n denote the smallest value of m for which Zm does not lie between b and o. 
Then n is the number of observations required by the sequential test. The 
expected value of n is a function of the true parameter value 0 and is denoted 
by Etf(n). 

Upper and lower limits for Ef(n) have been derived in section 4 of [1]. These 
limits, however, are of little practical value when the expected value of 
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(1.3) 


z = log 


/(a;, OQ 

/(x, 0o) 


is in the neighborhood of zero, for they converge to + » and — oo, respectively, 
as the expected value of z approaches zero. It can be shown that the expected 
value of z is negative when d = 6o, and positive when 9 = 9i ^ Thus, if the 
expected value of z is a continuous function of 9, there will be a value 9' between 
do and 9i such that the expected value of z is zero when 9 = 9'. Hence, the 
limits for Et{n), as given in [1], are of no practical value when 9 is near 9' 

The purpose of this paper is to derive upper and lower limits for Esin) which 
will be, in general, close to each other when 9 is in the neighborhood of 9' Thus, 
it will generally be possible to obtain close limits for Ei{n) over the whole range 
of 9, if the limits given here are used for values in a certain small interval con¬ 
taining 9', and the limits given in [1] are used when 9 is outside this interval. 


2. Notation. We shall use the following notations throughout the paper. 
For any random variable u, the symbol E/{u) will denote the expected value of 
u when 9 is the true value of the parameter. The conditional expected value of 
u, under the restriction that some relationship R is fulfilled will be denoted 
by E)(u I R). The symbol P{R \ 9) will denote the probability that the rela¬ 
tionship R holds when 9 is true. 

The cumulative distribution function of z will be denoted by F{z, 9) when 9 
is the true value of the parameter. The moment generating function of z, 
when 9 is true, will be denoted by ip{t, 9), i e 

(21) <p{t, 9) = f e" dF{z, 9), 

J—bO 

3. Assumptions concerning the family of distribution functions F(z, 9). In 
this section we shall formulate two assumptions concerning F{z, 9) which will 
then be used to prove various lemmas and theorems Since we are interested 
in values of 9 near 9', we shall restrict the domain of 6 to a fimte closed interval 
I containing 9' in its interior. It will be understood throughout the paper that 
any statements concerning 0 refer to the domain I, even if this is not explicitly 
stated. 

Assumption 1. The moment generating function ^(i, 9) exists for any point 
t in the complex plane and any value 9, and is a continuous function of 9. 

Assumption 2. There eists a positive d such that P(e‘ > 1 + 5 ] 0) andP{e‘ < 1 
— 5 I 6) have positive lower bounds with respect to 9. 


4 . Proof that (pit, 9) is continuous in i and 9 jointly and that all moments of z 
are continuous functions of 9. ^ In this section we shall prove the following 
theorem: 


1 This follows easily from Lemma 1 in [1], p 156 

' The original proof of the author was somewhat lengthy The present proof was sug¬ 
gested by T E Harris 
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Theorem 4.1. It follows from Assumption 1 that ip(^t, 6) is continuous in t and 
6 jointly and all moments of z are continuous functions of 9. 

Pbooe: First we show that <p{t, 9) is a bounded function of i and 9 in the 
domain | i | ^ 4 , for any finite positive value U • ‘Clearly, 

(4.1) 0 S I 9) I g 2Mto, 0) + vi-U, 6)] 

for all values t for which \t \ ^ U, . The boundedness of ip{to , 0) and vi — U , 9) 
follows from Assumption 1, Hence <pil, 0) is a bounded function of 9 and t 
over any bounded i-domain. 

Let , (9mj (wi = 1, 2, • - , ad inf.) be a sequence of pairs converging to 
the pair (t', 0'). We have 

(4 2) - v{i', S') = l<p(U, 9J ~ v>(i', e„.)] + Mt', 9„,) - fl')]. 

The second expression in brackets converges to zero by continuity in 0. Thus 
the first part of Theorem 4.1 is proved if we show that 

(4.3) lim [<o(f„ , 0„) — ipit', 9,„)] = 0. 


It follows from Assumption 1 that for any given 9, tpif, 0) is an analytic func¬ 
tion with no singularities in any finite f-domain. Hence we can expand (p(f„ , 
flm) in a Taylor aeries around 4 = t', i.e. 


('tA) 


'P{tm , 9m) — ^»(t^ 


^ 1 / aV(4, Om ) 

iWfciV a4» 



t'f. 


Let r be a given positive value. Because of the boundedness of ip(4, 0) in any 
finite 4-domain, there exists a constant M such that ] ipif, 0) | < M for all 9 
and for all 4 in the domain | 4 — 4' j S r. From the Cauchy integral formula 
for an analytic function it follows that 


(4 5) 


1 

9m) 


fc! 

34^ 



From (4.4) and (4,5) we obtain 


(4.6) 


k(4.. 9m) - 9m)\SM't 

k-1 r* 


Equation (4 3) is an immediate consequence of (4.6), This proves the first 
half of Theorem 4 1. 

Let C be a circle in the complex 4-plane with finite radius and center at the 
origin. According to the Cauchy integral formula we have 


(4 7) 


-L f 

2riJa 4*+^ 


di 


1 aV(t, 9) 
fcl dt’’ 


1-0 


fc! 


F,(«‘). 


Since tp{i, 9) is continuous in 4 and 0 jointly, the integral on the left hand side of 
(4.7) is a continuous function of 9. This proves the second half of Theorem 4.1. 
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5. Some lemmas. In this section we shall prove several lemmas which will 
then be used to derive the results contained in sections 6 and 8. 

Lemma 6.1. It follows from assumptions 1 and 2 that for any givm 6 the equa¬ 
tion in t 

(5.1) t(>{t, fl) = 1 

has exactly two real roots, one of which is zero. The othei real root is different from 
zero if E^iz) 0. If Ee{z) = 0, both roots are equal to zero, i.e., zero is a double 
root of (6 1). 

This lemma is essentially the same as Lemma 2 in [2] and the proof is therefore 
omitted.' 

Let h{6) denote the non-zero root of (5 1), if Eb{z) 9^ 0. If Ej{z) = 0, we 
put h{6) = 0. 

In what follows the variable t will be restricted to real values, unless the 
contrary is explicitly stated. 

Lemma 5.2. It follows from assumptions 1 and 2 that h{6) is a continuous 
function of 6. 

Proof: It follows from assumption 2 that 

(5.2) lim tf>{t, 0) = -b 00 

<-*±« 

uniformly in 6. Hence, since by definition 

v[h{6), e] = 1 

identically in 6, h{6) must be a bounded function of d. 

Let [flm] be a sequence of parameter values which converges to 6*. From 
Theorem 4.1 it follows that 

(6.3) lim [<p(«, O - vit, e*)] = 0 

m —*00 

uniformly in t over any finite interval. Since h{6) is bounded, we obtain from 

(5.3) 

(6.4) lim lvl/i(0, - vlhidm), 5*]) = 0. 

m—*v» 

Since ip[hid„), e„] = 1, it follows from (5 4) that 

lim vlhidm), 0*] = 1. 

m-*oQ 

It follows from assumption 1 that for any limit point h of the bounded se¬ 
quence {/i(6m)) (m = 1, 2, • •ad inf.) we have 


® Condition IV of Lemma 2 in [2] la not postulated here, since the validity of this con¬ 
dition is implied by assumption, 1 Condition IV could have been omitted also in [2], 
since it follows from condition III. 
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( 6 . 6 ) viK 0*) = 1 

If h{e*) = 0, then equation ipit, 6 *) = 1 has the only root t = 0. Conse¬ 
quently, all limit points of {h{6m )) must be equal to zero, that is 

(6.6) lim h(6m) =0 if h(6*) == 0- 

tn—too 

Now let us assume that h{d*) 9 ^ 0. Since the second derivative of <p{t, e) 
with respect to t is positive, it can be seen that ^(t, 0) < 1 for values t in the open 
interval (0, hi 6 )), and^(<, fl) > 1 for any i outside the closed interval [0, hid)]. 
Hence, (p(t, 0 ) < 1 implies that | h{ 0 ) | > | < | and h{d) and i have the same 
sign. Now let < 0 , be a value in the open interval (0, h{ 6 *)). Then we have 

( 5 . 7 ) ^(U ,e*)<i 
It follows from assumption 1 that 

(5.8) ,e„)<i 

for sufficiently large m. Hence h(0m) and to have the same sign and 

(5.9) I hie..) 1 > I to I 

Inequality (5.9) implies that zero cannot be a limit point of the sequence 
(/i(0m)). Since <p(t, 6*) = 1 has only the roots t = 0 and t = h(e*), it follows 
from (6.5) that the sequence {h{6n)} cannot have a limit point different from 
hie*). Thus, 

(540) lim hie„) = hie*) 

and Lemma 6.2 is proved. 

Lemma 5.3. It follows from assumption 1 that for any given t, Efie^'^^) is 
a bounded function of 6 . 

Proof: We have 

(5.11) Eiie'*’') ^ Esie'‘ + e"") = v»(t, 8) + ^(-t, 0) 

It follows from assumption 1 that vit, 8) and w(—t, 0) are bounded functions 
of e. Hence Lemma 5.3 is proved. 

Lemma 6.4. Let e' he a value of e such that Ei'iz) — 0, but Eeiz) 9 ^ 0 for all 
6 9 ^ e' in an open interval containing 8 '. It follows from assumptions 1 and 2 
that 


(642) 

Proof: We have 



(5.13) = 1-1- hie)z -H z' + 

2 6 


where 0 g w g 1, Hence 
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(5.14) = 1 + h{e)EB(2) + Ea{z^) + Ee(ze'‘''^'^'). 

2 o 

Since Ee(e‘^‘^‘) = 1, we obtain from (5,14) 

(5.16) h{e)Eaiz) + Ee(z^) + E,( 2 * e“*'’’‘) = 0. 

2 D 

We shall consider only values d for which h(6) ^ 0. For such values of 6, 
also Ee{z) ^ 0. Dividing (5.16) by h{9)Ei{z), we obtain 

^ + 2 ^) ^ 

Let io be an upper boimd of | ii(fl) 1 with respect to 9. Then for a suitably 
chosen constant C we have 

(5.17) I I < Ce""'' 

From this and Lemma 5.3 it follows that ^((gV***'') is a bounded function 
of B. 

Because of the continuity of h{d) we have 


(5.18) lim h{6) = 0. 

Lemma 5.4 follows from (5.16), (5.18), the boundedness of J5/8(«V*'*’‘) and 
the fact that Ej\^) is a continuous function of B and Et>{^) >0. ^ ^^ ^ 

Lemma 5.5. From assumptions! arid 2 it follows that for any given t, Ei{e ") 

exists and is a hounded function of 6, 

Proof: It is sufficient to show that Fj(e*^*) is a bounded function of 6 for 
any t, since 

(5.19) ^ e‘^" + e""'" 

Clearly, e‘^" lies between and Hence Lemma 5.5 is proved if 

we show that Ee{e‘’'‘) is a boimded function of B. 

It follows from Assumption 2 that there exists a positive integer k and a 
positive constant g such that 

(5.20) P{\zi + ■■•+ zt, \ ^ a - b\0) ^ g 

for all B, For any positive integer m and for any real values Xi < Xj we have 


(5.21) 

and 


P[(m — \)k<n^mk\B]^ 
P[(m - !)k < n\e] “ 


P[{m — !)k < n ^ mk St'ki g < Xa | g] 
P[{m — l)fc < nlS] ~ 


(m = 1 , 2 , • • •, ad inf.) 


(5.22) 


^ 1 - [1 - P(Xi 2 < Xj I fl)]*. 
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Hence 


(5.23) 


PKw — 1)& < w g wfc & Xi ^ z» < Xj I e] 
P[(m - l)fc < n g mk\8] 


^ 1 - [1 - ^(^1 g g < Xa I <?]* 

g 


Multiplying (6,23) by P[{m — 1)A: < n ^ mk | 6] and summing with respect 
to m Tve obtain 

1 - [1 - P(Xi ^ 2 <Xi !<?)]* 


(6.24) 


F(Xi g 2n < Xj 10) ^ 


9 


From (5.24) it follows readily that 


(5,25) 


P(Xi g 2„ < X; I e) 
P(Xi ^ 2 < Xj 1 0) 


is a bounded function of Xi, Xs and 6. Let A be an upper bound of the ratio 

(5.26) , Then 

(6.26) H<,(e"") ^ AE,{e'‘) = A,p{,t, d). 


Because of Assumption 1, ipij,, 6) is a bounded function of 0. Hence also 
H»(c‘'") is bounded and Lemma 5.5 is proved. 


6. The limiting value of Es(ji) when 0 approaches a value 6' for which 
Ev(?) == 0. In this section we shall prove the following theorem: 

Thbobbm 6.1. Let 6' he a value of 0 such that Et'{z) = 0, hut Eeiz) 5^ 0 for 
all 6 9^ 0’ in an open interval containing O', If assumptions 1 and 2 hold, we have 

(6.1) Im ^Eein) - j = 0* 

Paooi'; Consider the Taylor expansion 

(6.2) e*'*’*" = 1 + h(e)Z,, + 

2 D 

where 0 ^ X ^ 1. It was shown in [2] (p, 286) that 

(6.3) = 1. 

Hence, taking expected values on both sides of (6.2), we obtain 

(6.4) h(fi)E,{Z„) + E,(Z\) + E,iZ\ = 0. 

We consider only values of 0 for which Ei(z) 0. For such values, also 
h{d) 0. Thus, we can divide both sides of (6.4) by h{0)Et{z), We then 
obtain 
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(6.5) 


^e{Zn) h{d) 


+ 


Eb(z) ' 2E,(z) 

It was shown in [1] (p. 142) that 


^EeZl + ^ = 0 . 


(6.6) Ecin) = . 

Eeiz) 

Hence 

(6.7) Ee{n) + ^Ee{Z],) + ^ = 0. 

Let to be an upper hound of | h{ 6 ) 1. Then for a properly chosen constant C 
we have 


(6.8) 1 g 

From this and Lemma 5.5 it follows that EeiZ^e^''^^^ is a bounded function 
of 6 . Since lim h{ 6 ) = 0 and Es(Z\) has a positive lower bound, Theorem 

6.1 follows from 6.7, Lemma 5.4 and Theorem 4.1. 

If lim EeZX = Eb'ZI. , Theorem 6.1 gives^ 

l->S' 

m n\ I? t^\ _ ^?9'(^n) 

(6.9) a.(n) = . 

Limits for ^^/(n) can be obtained by computing limits for Eb{Z\). In the 
next section We shall give a general method for obtaining limits for E 0 [f{Z„)], 
where ^(Z„) is any function of . 


7. Determination of lower and upper limits for the expected value of any 
function of Z„. Let ^(Z„) be a function of Z„. Limits for Es[\f/{Zn)] may be 
determined as follows: First we determine limits for F7j[i/'(Zn) [ ^ o]. Let r 

be a positive variable. Clearly, for any given value r we have 

(7 1) E-eiifZj,) 1 Z„_i = a — r and Zn ^ a] = (a - r + z) | z ^ 

From (7.1) we obtain the limits 

g.l.b. Fj[^(o — r 4- 2 ) 13 ^ r] ^ Ee[\l/{Z„) \Z„t a] 

0<r<o—b 

g l.u.b. Ee[il^ia “ r + 2 ) I 2 S r]. 

0<r<a—b 

Limits for F?s[^(Z„) ] Z„ g b] can be obtained in a similar way. Again, let 
r be a positive variable. For any value of r we have 

(7.3) EeUiZ,,) 1 Z„ ^ b and Z„_i = b + r] = EelHb + r + 2 ) | z g -r 
Hence we obtain the limits 


* The validity of (6 9) was shown by the author [3] using an entirely different method. 
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g.l.b. Eg[\li{b + r + z) I 2 ^ —r] ^ Eihp(Z„) j Zn ^ 

0<r<a—6 

g l.u.b. Ee[>Pib + r + 3) 1 2 g -r]. 

D<r<a—b 

Since 

(7.5) Eg[^{Z,)] = P(Z„ ^ a)Eg[HZn) I Z„ ^ a] + PiZ„ g h)E,[i^{Z„) 1 g b], 

a lower (upper) limit for EelipiZn)] can be obtained, by replacing the condi¬ 
tional expected values on the right hand side of (7 5) by their lower (upper) 
limits given in (7,2) and (7.4). 

8. Limits for Eg{n) when h{d) is near but unequal to zero. Let 6' be a value 
of 6 for which h{6') = 0. In this section we shall derive limits for Eein) which 
will generally be close to each other for values 0 in a small neighborhood of 
Prom equation (6.7) we obtain 

(8.1) Eg{n) = ^EeZl + ^ Eg{Z\ 

where 0 ^ X g 1 Thus, limits for Ee(n) can be obtained by deriving limits 
for EgZ^n and Ee{Z\^''''’^ . Limits for EsZ^„ can be obtained by using the 

method described in section 7. 

If 6 is near 6', any crude limits for Eo{Z\i'’'''‘^ *") will serve the purpose, since, 
as has been shown in section 6, is bounded and lim h{B) = 0. 

Limits for Eg{Z \^''^^^can be obtained as follows: For simplicity, let us 
assume that hid) > 0. Then 

(8.2) Zl ^ ^ (^(0) > 0) 

Thus, to determine limits for Eg{Z\^''^’^ it is sufficient to determine a lower 
limit for Eg{Z\) and an upper limit for F8(Z^e*'®’ ^"). The latter limits may be 
derived by using the method given in section 7. 

If h{6) < 0, we have 

(8 3) Zl i S 

and a similar procedure will yield the desired limits for Fo(Z’„e^*^®^ ^"). 

It should be emphasized that the limits of Fo(n), as given in this section, 
can be expected to be close only if h{d) is near zero. For values of 0 for which 
h{d) IS not near zero, the limits of Fj(n) given in [1] can be used. 
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THE EFFICIENCY OF THE MEAN MOVING RANGE 
By Paul G. Hoel 
University of California at Los Angeles 

Summary. In. studying the variation of a variable subject to erratic trend 
effects, it is customary to employ as a measure of variation a statistic that 
eliminates most of such effects It is shown in this paper that the statistic 
w = 53" ^ I *»+i ~ 1 ’s/vj'i'in — 1) is nearly as efficient as the statistic 

5 = 53i (®t+i ~ XiY/{n — 1) that is customarily employed. The asymptotic 

variance of w is obtained by integration techniques, the proof of the asymptotic 
normality of w is based upon a theorem of S. Bernsbein on the asymptotic dis¬ 
tribution of sums of dependent variables. The method of proof is sufficiently 
general to prove the asymptotic normality of w, and of S*, for x having a dis¬ 
tribution for which the third absolute moment exists. 

1. Introduction. Let xi, X 2 , • ■ ■,Xn denote a random sample of size n from 
a population with a continuous distribution function f(x ). If a measure of the 
variability of x is desired, it is customary to select the familiar statistic 

53 (at. — 

J _ *=i 


or its positive square root s, as an estimate of the corresponding theoretical 
measure of variability. 

If, however, it is known that the variable x is subject to trend effects and that 
f{x) represents the distribution of x without such effects, then 5 ““ will not serve 
as a satisfactory measure of variability about the trend. In order to eliminate 
the influence of trends, it is helpful to employ statistics that capitalize on the 
time order relationships of the observations. There are several statistics of 
this type available, although most of them make no pretense of completely 
eliminating trend effects, even if the trend is linear. 

Perhaps the best known among statistics of the desired type is the mean 
square successive difference, 

( 2 ) -=- 

71—1 

This measure of variation has been studied extensively in recent years. Among 
the results of these investigations is a determination [1] of the efficiency of 
572 as an estimate of / for a normally distributed variable when no trend exists. 

A closely related measure of variation that is not so well known is the mean 
moving range of successive pairs of observations, 
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n—I 

L I a:;+i - Xi I 

1-1 


Although w appears [1] to have been used by baUisticians, very little seems to 
be known concerning the relative merits of and w. Since w is considerably 
easier to calculate than it would be preferred to 5^ for applications in which 
computational advantages are important. However, one would hardly allow 
such advantages to dominate a choice unless 5* and w were about equally efficient 
as estimates of variation. 

The purpose of this paper is to determine the efficiency of w and to study 
efficiency properties of generalizations of w. 

2. Definition of efficiency. The definition that will be used in tins paper 
[2] may be stated in the following manner. Let 6 be a parameter, or a function 
of parameters, of the distribution function f{x). Let T be a statistic for which 
there exists a number ^ such that 

t = \/n (T — 6) 

is asymptotically normally distributed with zero mean and variance Let 
T' be any other statistic for which there exists a number ix' such that 

i' = Vn (r - d) 

is asymptotically normally distributed with zero mean and variance Then 
T is said to be an efficient estimate of 0 provided that n < ix' for all possible 
choices of T', and the efficiency of any particular T' is defined to be 



In order to determine the efficiency of a statistic, it is therefore necessary 
to first demonstrate its asymptotic normal distribution and then calculate its 
asymptotic variance. This order of procedure will be reversed in the following 
determination of the efficiency of w. 

3. Variance of w. Let x be normally distributed with zero mean and unit 
variance. Then the mean of w, where w is given by (3), may be evaluated as 
follows: 

Ei-w) = E\xi- xt\ 

= ^ / J 1 xi - xr I dxidxi 

^ ^ L ~ dxi + j (xi — dxj 
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= h£ -2 dx, + j dx ,, 

If integration by parts is performed on the first integral with 

u = f dxi and dv = X 2 dxa, 

the uv term will vanish at both linuts and E{w) will reduce to 

( 6 ) EM =- r e-^ldxo = 

This result could have been obtained more easily by other methods, but some 
of the integrals involved will be needed later. 

For the purpose of computing the second moment of w, it is convenient 
to separate the independent and dependent product terms of w®. Since there 
are 2(n — 2) of the latter, F(w^) may be expressed in the form 

(n — lySM) = (n — 1)E I xa — *1 f 4- 2{n — 2)jB j xj — xi jj xa — Xs j 

+ (n - 2)(n - 3)F' | Xa - Xi |. 
But 

1 X 2 - Xi I* = E(x 2 - Xif = Eixl) + Eixl) = 2. 
Consequently, because of (5), 

(n — l)*F(w^) = 2(n — 2)E 1 Xa — Xi 11 Xa — Xa 1 + 2(n ~ 1) 

( 6 ) 

+ 4(n - 2 )(ti - 3)/ir. 

Now consider the evaluation of the product term 

E 1 Xa — Xi 11 Xa — Xa 1 = (2ir)~^ J J j" 1 Xj — Xi H X 3 — Xj le~*'’'i'''*a+‘'P dxidxidxs- 

By means of the expressions that were used to give (5), this triple integral may 
be reduced in the following manner: 

E 1 Xa - Xi 11 Xa - Xa 1 = (2x)^* / 1 

• 2 1 ^x 2 j dxi + J dxo cto 
~ (2x)~* J j^xa j dxi + dxt 

= 4(2,r)-» £ [x^ (^£ rfxij 

+ 2 x 26 '^*“^“^ J dxi + e~*“ J dxt. 
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These three integrals, without their constant factors, will be denoted by I 
h, and h, respectively. may be evaluated by integrating by parts with 

= 3:2 (jf and dv = xj . 

The uv term will vanish at both limits, consequently 


(7) 


“ ^ I. *’ I ' *.*.+ /] (J' «-<•!»> 


^ ’ dxij dx2 . 

T^ hrst of these two integrals may be evaluated in the same manner as the 
rst integral preceding (5). The second integral may be evaluated by makine 
the change of variable “ 


Jo 


As a result of such manipulations, 

J = _L 2t 

' 3 6 ■ 

It will be observed that h is the same as the first integral of (7) and that h 
IS available in tables; hence 


■E 1 352 — ail 11 xa — xj I 


( 8 ) 


= 4 ( 20 "' + ^ + y^'j = ^ + 

If ( 8 ) is substituted in ( 6 ), E(w^) will reduce to 


(9) 


£(w°) + 4(n-2)(n-3) 

(n - 1)21_ ,r J ^ _ 1 ^ __ 


o^w ~ ~ following desired variance 


( 10 ) 


2 

(Tw — 


(n - 1)2 




“ L° dWiibuted with mean m and 

thu ya^nce of w aa given by (10) wiU bu multipUed by a’; conaequently a = 

tot “ “ ■-‘i- ^ ™ll 


t' = y/ n(z — a) 
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possesses an asymptotic normal distribution From (10) and section 2, it there¬ 
fore follows that the asymptotic variance, that is needed to determine the 
efficiency of z is given by 




Now it is known that for x normally distributed s, as defined by (1), is an 
efficient estimate of o- with = \, consequently, because of (4), the efficiency 
of z as an estimate of a is given by 


( 11 ) 



.606. 


In [1] it was shown that for x normally distributed was an unbiased 
estimate of 0 -“ and, assuming the normality of its asymptotic distribution, 
that the efficiency of f/2 as an estimate of a was 2/3. Thus, z = W'\/ir/2 
possesses very nearly the same efficiency as a measure of variation of a normal 
variable as S^/2 does. 


5. As 3 rmptotic distribution of mean moving ranges. Although the efficiency 
obtained in the preceding section requires for its validity merely a demonstra¬ 
tion that for X normally distributed w possesses an asymptotic normal dis¬ 
tribution, it will be shown m this section that general mean moving ranges of 
a continuous variable x possess asymptotic normal distributions provided only 
that X possesses a third absolute moment. 

Let v, denote the range of the observations from a:, to a:,+i_i. Then the 
variable 

(12) W = + + + 

n — k + 1 

will represent a generalized mean movmg range, of which w will be a special 
case when k = 2. 

A proof of the asymptotic property of W can be constructed as an applica¬ 
tion of a general theorem of S. Bernstein [3]. Since his theorem is long and 
involves much explanation of notation, a simplified version of it that is sufficient 
to cover this application, and indeed many similar applications, will he given. 

Let yi,y%, ■ •, ym denote m variables for which the third absolute moments 
are bounded and let 

/Sm = yi + 3/2 -f- — + !/m . 

Then Bernstein’s theorem implies that if there exist constants Ci, c^, ci, and c« 
such that 

C\m < < CjUi, 


(a) 

and 
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(b) 

then 


yi and y, |.j are independently distributed for 

g > C 3 OT'*, C4 < 


- E{S^) 

possesses an asymptotic normal distribution with zero moan and unit variance. 

Consider the application of tliis theorem to ft = (n — fc + 1) W. The vari¬ 
ance of ft may be expressed in compact form by means of the techniques of 
section 3. Since ri is the range of k consecutive observations, it is clear that 

B(r.r.+,) = E\n) 


g > h. Furthermore, for subscripts for which it is defined, Eirin+n) will 
be independent of i. These two properties may be used to collect terms in the 
expansion of B(ft^) to give 

B(ft') = (n - A; + l)E{r\) + 2 Z - i)E{nn+,) 

t-0 

+ (n - 2fc + l)(n - 2A; + 2)E\ri). 


Consequently, 

(13) 


k~i 


in-k + DEirl) + 2'£in-k- i)Einn^i) 


-h [n(l - 2k) + {k ~ 1)(3A; - l)]ft’(rx). 


From the definition of the correlation coefficient and the fact that a correlation 
coefficient cannot exceed one, it follows that 


ftCnra+i) < Eiri)Eirti.) -f (7ri<rr,+, 

< E\n) 4 - 4. . 

If this inequality is applied to (13), 

4,< in-k+ l)ft(r?) + {k- l)(2n -3k + 2)lE\n) + + [n(l - 2k) 

+ ik ~ l)(3fc - l)]#(rx) 

< in - k + l)[ft(rj) - B'(ri)] + ik ~ l)(2a - 3A: + 2)<r4 

< [n(2A! - 1) - (A - 1)(3A: - l)]<TrJ 

< 2A:<r,J(n - k + 1). 

Thus, for a fixed k the right inequality in (a) of Bernstein’s modified theorem 
is satisfied. 

For the purpose of demonstrating that the left inequality in (a) is also satis¬ 
fied, consider the following application of Schwarz’s inequality. Let 
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(14) 


G{Xp ) ■ ■ ■ , Xh) — j ' ‘ ' J ^' " ' /(®*+J>—l) ■ ‘ ' dXh+p—i ) 


wher,e f{x) denotes the distribution function of the variable x and the range of 
integration in this and subsequent integrals is from — » to oo. Since Xp and 
/ are continuous non-negative functions, this integral is a positive function of 
the indicated variables. Then, denoting G(xp , ■ ■ •, xii) hy G, it follows from 
Schwarz’s inequality that 


fiXk)0 V 


T 


^ ^ dxi - • • dXkJ 

(15) =[/■•■/ {rifixi) ■ - • f(xk)G]^ infix,) 

< J f ^if(^i) • • • fixk)Gdxi dxkj ■■ ■ j nfixi) ■ ■ ■f(xt)G~^ 

dxi • • • dxk. 

The two integrals of this inequality will be denoted by 7a and T^, respectively. 
If the value of G given by (14) is substituted in , it will be observed that 

(16) ~ j " ' j • 

Now 7fl may be written in the form 


7^ = I ... I fixp) • • ■ fiXk)G~^^ / “ ‘ / 


dXn 




Since the x, possess the same distribution function and n is the range of the 
variables from xi to xi, the integral in brackets is equivalent to the integral 
defining G in (14); hence 

(17) 7fl = J ■■■ f fixp) • • • f[Xk)G~^Gdxp • • • dxfc = 1. 

If (16) and (17) are applied to inequality (15), they will yield the inequality 
j" • • • j" r]/(xi) • •' J{xk) dxi • • • dxij 

riTpfixi) • • • fixk+p-i) dxi • • • dxk+p-i . 

In statistical language, this inequality states that 

E\n) < Einrf), 


or, what is equivalent, that 
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( 18 ) 


E"(ri) < E{rjj). 


If (18) is applied to (13), 

al>in-k + l)E(rl) + (k- l)(2n -U + 2)E\n) + [n(l - 2k) 

+ {k- 1 )( 3 * - 1)]E\t,) 

> (71 ~ fc + l)[fi(r!) - i;=(ri)] 

> (Trlin — fc + 1). 

Thus, for a fixed k the left inequality in (a) of the theorem is also satisfied, and 
it merely remains to be shown that condition (b) is satisfied. 

For k fixed, r, and r,+j will be independently distributed provided that g >h. 
But if Ca > k, then 03(71 — k +1)°‘ > & for 0 < C 4 < ^ because 7 i — fc + 1 > 1; 
consequently n and r ,+5 will be independently distributed for {7 > 03(71 — k + 1 )‘‘, 
where 0 < Ci < ^. Thus, conditions (a) and (b) are both satisfied by R. Since 
12 = (71 — fc + 1)1F, it therefore follows that 


(19) 


W - E{W) 

aw 


possesses an asymptotic normal distribution with zero mean and unit variance 
provided only that x possesses a continuous distribution function for which the 
third absolute moment exists. The existence of the third absolute moment for 
X insures the existence of the same moment for u . 

If fc = 2, W reduces to w, and theroforcthe validity of (11) is assured. 


6. Other asymptotic distributions. The only property of the range employed 
in the proof of the preceding section was its positive nature; consequently the 
proof is applicable to moving means of other dependent statistics that are posi¬ 
tive and possess third absolute moments. 

For example, the preceding proof can be applied to 5^ to show that S* possesses 
an asymptotic normal distribution provided only that the sixth moment of x 
exists. In the study [1] of the efficiency of for x normally distributed, no proof 
was given of its asymptotic property. The preceding proof could be used in 
studying the efficiency of 6^, or obvious generalizations of it, as measure of 
variation for non-normal populations. The normality of the asymptotic dis¬ 
tribution of the serial correlation coefficient could also be verified by means 
of this proof. 
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CONFroENCE LIMITS FOR THE FRACTION OF A NORMAL 
POPULATION WHICH LIES BETWEEN TWO GIVEN LIMITS^ 


By J. WoiiFowiTz 
Columbia University 

Sununaiy. Let n and a be tbe unknown mean and variance, respectively, 
of a normally distributed population on which N independent observations 
xi, • • •, Sjf have been made. Let Li and Lj, Li < Lj, and a, 0 < a < 1, be 
given constants. We define the following symbols: 

(a) 7 - (VS,)-' /2 expj- 1 dy 

(b) X = l\r‘2x. 

(c) s^={N - l)-‘2(a:. - 

(d) xi-a as that number for whidh P{x < Xi-a} = 1 — a where x* has JV — 1 
degrees of freedom. 



It is proved that, under restrictions stated precisely below, and before the 
observations are made, the probability that D < 7 differs from a by a number 
which can be made arbitrarily small by making N sufficiently large. Thus an 
approximate (large sample) lower confidence limit for 7 is obtained. Similar 
methods can be applied to obtain upper and two-sided confidence limits. 

A problem raised by the present paper (but not attacked here) is to investi¬ 
gate the rapidity of approach to a of F{D < 7 }. It would perhaps be useful 
to obtain a series for the latter in powers of N~, the first term of such an ex¬ 
pansion is obtained here 

> Formula (5 1) of the present paper was given without proof by the author in July, 
1945, in solution of a problem put to him by Dr M A Girshiok At the time, both were 
members of the Statistical Kesearch Group, formed in the Division of War Research of 
Columbia University under contract with the National Defense Research Committee of 
the Office of Scientific Research and Development The validation of formula (5.1) in 
all rigor as it is given in the present paper was constructed by the author after he was no 
longer a member of the Statistical Research Group 

In January, 1945, Professor A Wald, then a consultant to the Statistical Research 
Group, and the present author jointly submitted to the Group an unpublished memorandum 
((fl410) entitled “Acceptance Regions Which Involve the Normal Distribution and Large 
Sample Sizes ” While this memorandum dealt with a different problem, its ideas were 
logically antecedent to formula (5 1) The present author wishes to express his indebted¬ 
ness to this memorandum and to his colleague Professor Wald. 
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1. The problem. Let /j and be the unknown mean and variance, respec¬ 
tively, of a normally distributed population on which the N independent ob¬ 
servations xi, X 2 , ■ • •, a:* have been made. Let Li and be given constants 
with Li < Li- We then have that 



is the fraction of the normal population which lies between Li and L2 . The 
problem considered in this paper is to construct a lower confidence limit for the 
unknown y, when N is large. An upper confidence limit or two-sided confidence 
limits may be constructed in a manner very similar to that described in the 
present paper. Since the construction of a lower limit is the problem which 
occurs most often in practice the discussion will be centered on it. 

A lower (confidence) limit on 7 with confidence coefficient a is a function 
D{xi , X/f) of the observations a;i, ■ ■ - , Xy with the property that, before 
the observations are made, the probability is a that D(xi, • ■ Xy) < 7. In 
any specific application it is unknown whether this last inequality holds, because 
7 is unknown. However, one who proceeds as if this inequality were true is 
using a procedure which will give correct results 100«% of the time in the long run. 

When either Li = — 00 or Lj = “ the solution, by use of the non-central t 

distribution, is well known. For a description of the procedure and necessary 
tables the reader is referred to [1]. 

2 . Acceptance regions. Let 70 be any value of the parameter 7. To 70 
there correspond infinitely many couples (m, a-) with the property that the 
normal distributions characterized by these couples all have a fraction 70 lying 
between Li and Lj ; we may write this symbolically by saying that the couples 
(m, (t) satisfy 

(2-1) 7(m, <r) = 70 . 

The construction of confidence regions is equivalent to the construction, for 
every 70, of an acceptance region Z?(7o) in theiV-dimensional Euclidean space, 
with the property that every normal distribution whose parameters n and <r 
satisfy ( 2 . 1 ) assigns to Riyn) the constant probability a. While this property 
of similarity (of. [2]) is sufficient for the construction of confidence regions, 
additional properties of the acceptance regions R{yf) are needed in order that 
the confidence region be an interval or that the upper confidence limit be always 
one (i.e,, that the confidence limits turn out to be a lower limit only), or to insure 
other features deemed desirable. 

It is easy to construct acceptance regions whioh will fulfill the condition of 
similarity. As an example, consider the case JV = 3 for convenience. Let bi, 
bj, be a number triple such that bi -f b, -j- b, = 0 . Let R{ya), for any given 
70,0 < 7o < 1, consist of all the points x\ ,xt, whioh are such that the absolute 
value of the angle ^(—ir < v) between the vector (bi, b2, bs) and the vector 
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(xi ~ X, X2 — X, Xa — x) does not lie between traya and tt + fl-Q:(7o — 1). (We 

define, in general, 2 = Nx. The points (aii, Xa, xa) for which xi = Xa = xa 

may be disregarded, since their probability is zero when the distribution is con¬ 
tinuous.) One readily verifies that the probability of R{ya) for any 70 ie a, 
no matter what n and <r are, and hence this is true in particular for the pairs 
which satisfy (2.1). 

The above method of constructing acceptance regions yields confidence regions 
which, while they cover the unknown 7 with confidence coefiicient ct, are not 
very meaningful otherwise. The fact that the probability of R{yo) is a whether 
or not (/X, a) satisfies (2.1) is already indicative of their lack of discrimination. 
Since X and s (where s is defined by 

ns^ = (x, — xf 

1 

and n = JV — 1) are sufficient estimates of 4 and cr, which in turn determine 7, 
it is clear that desirable confidence regions should be functions only of S and s. 
Consequently our first task.must be to construct the acceptance regions i2(7o) 
in the x, s plane. In the present paper we construct in the x, s plane regions 
R(yo) which have the property that their probability, under any normal dis¬ 
tribution whose parameters satisfy (2.1), differs from the prescribed a by a quan¬ 
tity which is bounded m absolute value for all 70, in such a way that the bound 
approaches zero as W increases. Thus when the sample number is sizeable we 
can obtain confidence regions for 7 which correspond to a confidence coefficient 
which differs little from a. Finally, the acceptance regions E(yo) which we 
shall construct will be such that the confidence region will be always an interval, 
and the upper limit will always be 1, i e., we will construct a lower confidence 
limit for 7. 

3 , Construction of regions Riya) in the x, s plane. First we describe two 
assumptions which we shall make. It is believed that these are reasonable 
from the practical standpoint and are satisfied in most actual investigations 
where the present problem arises. Mathematically their purpose is to enable 
us to secure a uniform bound on the difference between a and the probability 
of R{yo) (for all 70) under all couples (/i, v) which satisfy (2.1). 

Assumption 1 : There exists a positive d such that 

Li -f- d <1 p < A 2 — d. 

In most practical cases where the present problem will occur 7 will be larger 
than 1. If the latter is the case and n were very near either Li or La, then o- 
would have to be very small. In that case other methods would have to be 
used in the solution of the practical problem. The present paper deals with 
the situation, unfortunately only too c omm on in practice, where a is not too 
small. Assumption 1 puts a lower bound on a for any given value 70. (The 
bound is a function of 70). 
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AasTTMPTioN 2: The standard deviation a is less than a positive number C. 

In most practical problems such an upper bound can reasonably be set. 
Naturally, tbe larger d and the smaller C the more a priori information is at 
our disposal, the closer are our approximations and the narrower our limits. 
The effect of Assumptions 1 and 2 is to place a lower limit G on 7 where 

G = 7 (I'i + d,C)= y{L, - d, C). 

Let 7 o be any positive number such that G < 70 < 1. For an x such that 
Li < X < La, let r(x, 70 ) be the positive number such that 

y{S, r{x, 7o)) = 70. 

We define xi-a to be that number for which 

P{x < Xx-«) = 1 — 0!, 

where x has n degrees of' freedom and P is the probability of the relation in 
parentheses. The number xi-« may be found in tables of the x^-distnbution 
ifjthe value of a is one of those in common use. Finally define 

To) = r(x, 70 ) 

The acceptance regions i2(7o), G < 70 < 1, which we shall employ, are defined 
as follows for any 70 , G < 70 < 1 : 

Li, X ^ Li 
s > <pix, 7o). 



4. Proof that P{itl( 7 o)} ~ a. This section will be devoted to a proof of the 
following: 

Theorem. Let f?( 7 o) be as defined in Section 3 for G < 70 < 1 . Let the assump- 
tions 1 and 2 of Section 3 be fulfilled. Then the absolute value of the difference 
between a. and the probability of R{yo) under any couple {a, <r) which satisfies (2.1) 
is less than any arbitrarily small positive e when N is sufficiently large, i.e., when 
N is suffidenUy large, 

lP{P(7o)} -«1 < 6 


uniformly for all (p, 0 ) which satisfy (2,1) with G < 70 < 1, and which fulfill 
Assumptions 1 and 2. 


Lemma 1. 


^r(x, 70) 
flx 


exists in the open interval Li < x < Li. 


Proof: We have 


_ 1 

V 27rr(x, 7o) 



1 



dy. 
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Differentiating with respect to x we obtain, since r > 0, 


e 


,-«2/2 




with 

Hence 

(4.1) 


R = 


Li 


T = 


Li — X 


dr _ / \ 

dx Vfie-®’/* - Te-^^iy ‘ 


Since R > 0 and T < 0 within the open interval Li < x < Li, it follows that 
dr . . 

— exists in the entire open interval. 

Lemma 2. In the open interval Li < x < Li , 

dP {s > <p(x, 7 o)} 
dx 

exists. 

Proof: We have, with k a suitable constant. 


P = k f dy^kT 

■s/nvif "(ft*. 


Hence 

(4.2) 


dP ^ — fcxi-. 
dx 


Yo)/»)xi-o 




^£ C -^ T ‘“ p (=^ r )- 


Lemma 3. Let S he any arhiira'nly small positive number. The Junction 

oj X and 'Vo is hounded for Li + d < x < Li — d, G < jo < t. 

Proof: Prom (4,1) we have 


dx 


dr 

dx 


< 


e 


,- B =/2 


+ e 


—T^Ji 


Re-RVi _ ye-Ti/2 

Therefore from (4.2) we have that 


f 1 —l\ ( T ^ \ 

< max. I - , = max. 1 =-,-=r- < -. 

- \R' T ) \Li - x’ X - Lj S 

is less than a constant multiplied by 


dx 




and is therefore bounded. 


Proof of the theorem: From Lemma 3 and the Theorem of the Mean it 
follows that, in the closed interval 

L, + l<x<Li-^, 

the function P{s > tp{x, 70 )} is uniformly continuous in x uniformly for ail 
in, a) which satisfy (2.1) with 0 < 70 < 1. Hence for every positive €1 there 
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exists a positive v < t: such that \ li — h \ < v, 

It 

ii + 2 — J ^ is ~ ^ J 


implies 


I -Pis > (pGi ,70)1 — P{s > ¥>(^ 2 ,70)1 1 < *1 . 

For fixed arbitrary *2 > 0 we have, whea iV is sufficiently large, 
P(|^ — ;i|<i)}>l — £ 2 , 

from Assumption 2 and the stochastic convergence of x. Now 

P (3 > ^(/i, 70 )} = “• 

Hence, when N is sufficiently large, 

I PfP( 7 (i)) — «1 < <i(l — ea) + *2 < «» + «2 • 
Since «i and £2 are arbitrarily small, this proves the desired result. 


6 . Construction of large sample confidence regions- The acceptance regions 
K(7o) whose size never differs from a by more than a uniform bound which 
approaches zero as N increases, readily yield a lower confidence limit for 7 
(within the approximation involved). The confidence region consists of all the 
7 o for which P( 7 o) oontams the observed x, s. Our acceptance regions P( 7 c) 
are so constructed that, if 71 < 72 , P ( 71 ) is entirely contained within P ( 72 ). Hence 
the confidence region is an interval, one end of which is always unity, as was 
desired. The rule for constructing the lower confidence limit D is, therefore, 
as follows: 

a) if f < Li or X > L 2 , then D ~ G 

b) if Zn < f < L 2 , then 

pCLj— 

( 6 . 1 ) B = exp{-V}dy 

where 

w = — 1 ■ ~. 

Xl-a 

(The value of D may be found in a table of the normal distribution. It is easy 
to see that s = <p{x, D), i.e., D is the smallest value of 70 for which x, s will still 
lie in P( 7 o)). 

If the statement H < 7 is made in a large number of cases, where the assump¬ 
tions are fulfilled and the sample size is large, the proportion of correct statements 
will be close to a. 
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NOTES 

This section is devoted to brief research and expository articles on methodology 
and other short items. 


ON SEQUENTIAL BINOMIAL ESTIRIATION 

By J. WoiiFowiTz 
Columbia University 

The present note, written after a reading of the very interesting paper by 
Girshick, Mosteller, and Savage [1], is for the purpose of adding a few remarks 
in the nature of a supplement. For the sake of brevity the notation and ter¬ 
minology of [1] are adopted in toto. 

Theorem 1 below generalizes Theorem 1 of [1], In Theorem 2' we formulate 
explicitly the fact which lies at the basis of the GSM method of estimation. Parts 
of the proofs of Theorems 3 and 4 of [1] are simply proofs of special cases of this 
(e.g., equation (2) of [1]). We then use this fact repeatedly in proving Theorem 
3, which states that the Girshick-Mosteller-Savage estimate is the only proper 
unbiased estimate for sequential tests defined by regions which we shall call 
doubly simple. 

A doubly simple region is defined precisely below. Intuitively we may de¬ 
scribe such a region as the one between two curves y = fi(x) and x = f 2 (y), 
where /i(x) is defined and monotonicaily non-decreasing for all non-negative 

My) is defined and monotonicaily non-decreasing for all non-negative y, 
MO) > 0, /2(0) > 0. If the two curves intersect, the region is finite, and the 
values of the functions /i and U beyond the point of intersection are of no inter¬ 
est This description is of course purely heuristic, because in actual fact only 
integral values of the variables come into play, and intersection of the curves, 
for example, is not needed to make the re^on finite. Since the question of finite 
regions is completely settled by [L], Theorem 7, only non-finite regions remain 
to be discussed, and the precise definition given below is such as to imply that 
the region is not finite. It seems to the present writer that at least many of the 
non-finite sequential tests which may be developed for meaningful statistical 
problems will require doubly simple regions. The Wald sequential binomial 
test [2] defines such a region, which also falls within the scdpe of Theorem 6 of 
[1]. It is easy to see that there exist closed regions which are doubly simple 
and do not satisfy the conditions of this theorem. 

By a ‘ ‘proper” estimate p(a) we shall mean an estimate such that 0 < p{a) < 1 
for every a. It is difficult to see how any estimate which is not proper can 
make much sense. 

Theorem >1. A sufficient condition that a region R be closed is that lim inf 

n— 

< 00 , where A (n) is the number of accessible points of index n. 
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Phooj’: The hypothesis of the theorem imphes that there exist a positive 
number H and an mcreaaing sequence of positive integers «i, nj, na, ■ • , with 
the following properties: 

a) 91(41 > 2n. (i = 1, 2, ■ ■ ■ ad inf.) 

b) A(th) < H -s/n, ■ 

For n. BUffidently large, the conditional probability of reaching the accessible 
points on a: -b 3/ = «.+i > when an accessible point on. x + y — n, has been 
reached, is < Zf < 1 by the normal approximation to the binomial distibution, 
where K is constant (and depends on H). Hence the probability of passing 
through accessible points on all members of the mix y = n, {i = 2, • • ■, i) 

approaches zero as i —♦ «>, so that the region is closed. 

Theorem 2. Let B be any region, B tts boundary, and t = (a, 6); any accessible 
point in R. Let li{a) he the number of paths from t to [x, y) = a e B. Let Q{t) 
be the conditional probability that a path, which has reached t, will reach the boun¬ 
dary B. Then 

a tD 

Theorem 2'. (Corollary to Theorem 2) 

If R is closed, then 

( 1 ) 

or tB 

Proof: Let k{t) be the number of paths in R from the origin to t. The 
probability of reaching a «B by a path which passes through t is k(S}lt{a)p''^. 
The probability of reaching t from the origin is k{t)p’’q'‘, and hence the prob¬ 
ability of reaching the boundary via t is Q{i)k{t)p''q°, From this the desired 
result follows. 

We now define a doubly simple region. The boundary of the region consists 
of the two infinite sequences of points 


(0, Og), (1, Oi), (2, On), • • • 


and 


(bo,0), (hi,l), (b,,2), 

where Oo, ffli, oj, ■ • • and bo, bi, hi, ■ • • are two infinite non-decreasing se¬ 
quences of positive integers. The accessible points of the region are all points 
which can be reached by a path from the origin which does not contain a boun¬ 
dary point. (It is to be noted that since a boundary point is, by definition, 
a point not in the region which can be reached by a path in the region, the above 
definition imphes that a doubly simple region is not finite. The reason for 
making this so has been given above.) 

Theorem 3. Let ii be a closed doubly simple region. Then p{a) is the unique 
proper unbiased estimate of p. 
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Proof: Suppose there were two proper unbiased estimates pi(a) and 
Writing m{oi) = pi{a) — we would have 

(2) 2^ m{a)k(a)p’'q’° = 0 

a €B 

with 

( 3 ) 1 m(a) I < 1 
First we prove 

Lemma 1. If oo > 1 , then m(6o, 0 ) = 0. 

Proof: Let It*(a) denote the number of paths in R from the point ( 0 , 1 ) to 
the boundary point a. For all points a « B except (bp, 0 ) we have 

( 4 ) bofc*(a) > fc(a). 

From ( 1 ), ( 2 ), ( 3 ), and ( 4 ) we have, since fc(bo, 0 ) = 1 , 

I m(bo, 0) I 5'’“ = I £ m(a)A(a)pV I 

. I atBtOpiibQ.O) I 

(o) . - ■ - 

< E k(a)pY <hT. k*(a)pY = 6«p. 

Now as p —» 0 , the left member of the inequality ( 5 ) approaches | TO(bo, 0 ) |, 
and the right member approaches zero. This proves Lemma 1 . 

Lemma 2 . For every z < oo — 1 , 'm(b, , a) = 0 . 

Proof: In view of Lemma 1 it is sufficient to prove the following: 

If 2^ < Co — 2, and if m(b,, z) = 0 for 2 = 0, 1, • • Z — 1, then m(bz , Z) 
= 0 . Let fcz+i(a) denote the number of paths in R from ( 0 , Z + 1 ) to the 
boundary point a. For any point oteB whose ordinate is > Z + 1 we have 

(6) bobi • • • bzfcz4i(oi) > kipi). 

From ( 1 ), ( 2 ), ( 3 ), and (6) we have 

(7) 1 m(bz , Z) 1 k{hz , Z)p^q^^ = 1 Sm(a)fc(a)pY 1 < Sfc(a)pV 

^ bobi * • • = bobi • • * 

where the summations take place over all boundary points whose ordinates are 
> Z + 1 . Hence 

1 m(bz , Z) 1 fc(bz , Z)g*‘^ < bpbi • ■ • bzp. 

and letting p —> 0 we obtain the desired result. 

Lemma 3 . m(bao-i, Oo — 1 ) = 0 . 

Proof: Let s be the smallest integer such that (s, Oo) is an accessible point. 
We proceed as in Lemma 2 , with (s, Op) playing the role of ( 0 , Z + 1 ), and 
eventually obtain the following inequality: 

I m (b«o-i, ao — 1) 1 A(ba„-i, Oo — l)p“'“^g''‘’«"' = | 2o m{a)k(a)p''q’ \ 
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where 2o denotes summation over all boundary points with ordinate > Oo • 
The desired result follows. 

Lemma. 4. Let h{> oo) be the smallest ordinate for which at least one boundary 
point (w*, h) exists such that m{w*, h) 9^0 (If no such h exists the theorem is proved). 
Of all such points let w he the one with the smallest abscissa. Then the point (w, h) 
is a member of the sequence 

(0, Oo) (1, a,), (2, Oj), • • • 

Proof: If the lemma is not true, then for all boundary points a with ordinate 
h, m{a.) = 0, except that m(bk , h) 0. Let 17 be that accessible point of R 
whose ordinate is h + 1 and whose absci^a e is a minimum. Let kw (a) be the 
number of paths in R from W to the boundary point a. For boundary points 
a accessible from W we have 

( 9 ) ?) q 6 i hhkwia) > k(ot). 

From (1), (2), (3), and (9) we have 

(10) I m{bK, h) 1 fc(6h , /i)pY" = I Si(m(a)fc(a)pY 1 < Sjfc(a)p‘+*(2^ 

+ bobi •. • bY'-'g’' = K*p’‘*\ 

where; 

a) Si denotes summation over all a e J3 for which y > h 

b) Sj denotes summation over all boundary points a of ordinate h + I and 
abscissa < v. 

c) K* denotes a constant. 

From this it easily follows that m{hi, ,h) — 0, in contradiction to the definition 
of h. This proves Lemma 4. 

Proof of Theorem 3: Let (w, h) be as defined in the statement of Lemma 4. 
From Lemma 4 it follows that, if any other boundary points with abscissa w 
exist, they must be members of the sequence (6o , 0), (i?i, 1), ( 62 , 2 ), • ■ ■ and 
hence their ordinates are < h. From the definition of (w, h) and from Lemma 4 
it follows that for any a « 5 whose abscissa is < la, m(a) = 0. 

Now in the proofs of Lemmas 1-4 the roles of x and y are not symmetrical. 
However, symmetry of course exists, and analogous lemmas follow. In par¬ 
ticular, the analogue to Lemma 4 has as a consequence that, since w is the 
smallest abscissa such that m(a) = 0 when abscissa of a < .w, and m(w, h) 9 ^ 0 , 
there exists a boundary point (lo, S'lch that m(w, h') 9 ^ 0 and (w, h') is a 
member of (60 , 0), (bi, 1), (bi, 2), ... Then h' < h. But this contradicts 
the definition of h and proves the theorem. 

It is easy to see that, if the boundary points of a closed region constitute 
a single “curve” instead of two "curves” as in a doubly simple region, the 
estimate p{a) will be the only proper unbiased estimate of p. 

It is interesting to consider some of the consequences of Theorem 3 for all 
unbiased estimates (not necessarily proper) for doubly simple regions. An 
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examination of the proof of Theorem 3 shows that it would go through with 
little change if equation (3) were replaced by the requirement that j m(a) j 
be bounded. We therefore obtain the followmg result; If for a doubly simple 
region there exists an unbiased estimate p(«) of p, not identically equal to p(a), 
then not only is p(a) not proper, but also, no matter how large M, there exists a 
boundary point a such that 1 p(oi) j > M. The uselessness of such an estimate 
is manifest. 

The author is of the opinion that freedom from bias is not necessarily an in¬ 
dispensable characteristic of an optimum estimate. In general there is no 
reason for requiring the first moment of the estimate rather than any other 
moment to be the unknown parameter. The justificatiofi in any particular 
case must be based on special conditions of the problem. 

The author is indebted to Mr. Howard Levene for reading the present paper 
and making valuable suggestions. 
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DIFFERENTIATION TINDER THE EXPECTATION SIGN IN THE 
FUNDAMENTAL IDENTITY OF SEQUENTIAL ANALYSIS 

By Abraham Wald 
Columbia University 

1. Introduction. Let {zb} (a = 1, 2, • , ad inf.) be a sequence of random 
variables which are independently distributed with identical distributions. 
Let a be a positive, and h a negative constant. For each positive integral value 
m, let Zm denote the sum zi -f- ■ • • + z™ . Denote by n the smallest integral! 
value for which does not lie in the open interval (b, a). For any random 
variable u, let the symbol Eiu) denote the expected value of u. The following 
identity, which plays a fundamental role in sequential analysis, has been proved 
in [1]. 

(1.1) F[e^"V(0""] = 1, 

where 

( 1 . 2 ) vH) = 

and the distribution of z is equal to the common distribution of Zi, za, • ■ •, etc. 
Identity (1.1) holds for all points t in the complex plane for which ^(i) exists 
and 1 <p{t) 1 > 1. 
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The purpose of this paper is to formulate conditions undet which we may 
differentiate (1.1) with respect to t under the expectation sign. This is of 
interest, since variohs results in sequential analysis can easUy be established 
by differentiating (1.1) under the expectation sign. For example, the formula 
for E{n) can immediately be obtained by differentiating (1.1) at t = 0. The 
derivative of e^"V(0~" at / = 0 is given by 

(1.3) ^ n = Z, - E{z)n 

where denotes the derivative of Hence, if we may differentiate (1.1) 
under the expectation sign, we obtain the basic formula 

(1.4) £?(Z„) = E{z)E{n). 

If Eiz) 7^ 0, the above equation has been used [2] to derive lower and upper 
limits for jB(n). If, however, E{z) = 0, formula (1.4) is of little value. It will 
be shown in section 3 that 

(1.5) E{n) = when E{z) = 0. 

This result is obtained, as will be seen in section 3, by differentiating identity 

(1.1) twice at i = 0. 


2. A sufficient condition for the differentiability of (1.1) under the expectation 
sign. In what follows, the parameter t in (1,1) wiU be restricted to real values, 
even if this is not stated explicitly. For any random variable u and any relation 
R, the symbol E{u\R) will denote the conditional expected value of u imder 
the restriction that B holds. In this section we shall establish the following 
theorem. 

Theorem 2,1. 1/ ^(t) exists for all real values t, identity (1.1) may he differen¬ 
tiated under the expectation sign any number of times with respect to tat any value 
t in the domain v(0 > 1. 

Proof: First we shall derive an upper bound for E{e'^" \ n = m) for any 
given integral value m. Consider the case when i > 0, Then 

(2.1) £(e'‘'"ln = m) < Eie'^'\Zn > a, n = m) (t > 0). 

Clearly, 

(2.2) Eie*^" \Z„>a,n = m, = pe") = ( e'‘ > . 


Let l{i) denote the least upper bound of the expression 


(2.3) 
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with respect to p over the interval (e <‘‘-‘>1*', i). The existence of v(t) implie® 
that lit) is finite. It follows from (2.1) and (2.2) that 

(2.4) E(e'^'‘ 1 % = m) < e“‘J(f) (i > 0) 

and, therefore, also 

(2.5) ®(e‘^") < e^‘lit) (<> 0). 

If i < 0, one can show in a similar way that 

(2 6) E(e‘^’' 1 n = m) < e'‘%i) (t < 0) 

and 


(2.7) Eie^^’') < e^%t) « < 0). 

To prove Theorem 2.1, it is sufficient to show that the following two proposi¬ 
tions hold.^ 

Proposition 2.1. All derivatives of «^’*V(0~" with respect to t exist in the 
domain <pit) > 1. 

Proposition 2.2. For any positive integral value r and for any finite interval I 
in which <pit) > 1, it is possible to find a function Z)(Z„ , n) such that 


( 2 . 8 ) 


DiZn ,n) > 




for all values tin I and 


(2 9) E[DiZn,n)] < 

Proposition 2.1 is clearly true, if all derivatives of (pit) exist. The existence 
of these derivatives follows from the existence of (pit) for all values t. 

Since ^ e " (pit) " is equal to the sum of a finite number of terms of the type 

Zn*n’‘*e^"V(i)~") Proposition 2.2 is proved if we can show that for any given 
integral values ri and rz there exists a function DrirtiZn , n) such that 

(2.10) Dr,r,iZn ,n)>\ Z;'n'^6^"V(0"” 1 

for all t in I and 


(2.11) ElDr,r^iZn , n)] < co . 

Clearly, since (pit) > 1 in J, 

(2.12) 1 1 < I Z’'f I n"*e' 

where to is an upper bound of | i | in J. Let h be a value > U - Then for a 
properly chosen constant C we have 

(2.13) 1Z;‘Ic'"'"'*' 


^See, for example, E J. McShane, Integration, Princeton University Press (1944), p. 
216, 217 and 276. 
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Hence, it follows from (2.12) and (2.13) that 
(2.14) 1 W)'" 1 < 

for all t in I. 

We put 

(2.16) I»,.r,(^„ , n) = Cn'*(e^-‘‘ + e-^-‘‘). 

We have 

(2.16) E[Dr,r,{Z„ , n)] = C i: i n = m) + B(e-^-'‘ I n = m)] 

rn-^l 

where pm denotes the probability that n — m. 

Hence, because of (2.4) and (2.6), we obtain 

(2.17) m,r,(^„, n)] < C(e“‘-f«i) + e-^‘*Z(-«,)][ 2 p„m^>] - 

= C[e‘"‘l(ii) 4- 

Since all moments of n are finite,’ Proposition 2.2 is proved. This completes 
the proof of Theorem 2.1. 


3. The expected value of n when E{£) = 0. It will be shown in this section 
that 


(3.1) 


E{n) 


EjZi) 

E{z'^) 


when E{z) = 0, 


if identity (1.1) can be differentiated twice under the expectation sign at t = 0. 
The second derivative of e*^V(f)~'' with respect to t is given by 


(3.2) 



^)T 

V(i). 


„ - r^'(on 

ww / 




where <fi'{t) denotes the first, and the second derivative of ^({). 

Since v?(0) = 1, <p'{0) = E(z) = 0 andp"(0) = E{z^), putting t = 0, expression 
(3.2) becomes 


(3.3) 


Zl - Tup"(fl) = Zl - nE{z^) 


Hence, if (1.1) may be differentiated twice rmder the expectation sign at f = 0, 
we obtain 


(3.4) E[Zl ~ n£(/)] = 0 
from which (3.1) follows. 

An approximate value of E{n) can be obtained from (3.1) by neglecting the 
excess of Zn over the boundaries. Then Zn can take only the values a and 
b. Hence 

(3.5) E{Zi) ~ o’P(Z„ > a) + b^P{Z„ < b) 
where the sign ^ denotes approxunate equality. 

•See the paper by C. Stein, "A note on cumulative auns,” in this issue of the Annals 
of Mathematical Statiahcs. 
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It was shown in [1] (equation 28) that neglecting the excess of over the 
boundaries, the approximation formula 

^ _ bh 

(3.6) P(Z.>a)^i^ 

holds, where h is the non-zero root of the equation <p{t) = 1. This formula was 
derived there under the assumption that E(z) 0. If E(z) approaches zero, 

-h 

h—>0 and the right hand member of (3.6) converges to ^ ^. 

Putting P(Z„ > a) = ^ and P(Z„ < &) = 1 — ^ ^ j, we ob¬ 

tain from (3.5) 

(3.7) £(Z‘.) ~ a’ (^j) +b'~- 
Hence* 

/rh 

(3.8) EM . 

Limits for E(n) can be obtained by denying limits for E(Zl). Let r be a 
non-negative real variable. One can verify that 

(3.9) o' g E(Zi IZ„>a) g l.u.b. E[(a - r + z)' | ? > r] 

0<r<a-b 

and 

(3.10) 6' ^ E(Zl lZ„^b) < l.u.b. E[(b + r + zfls + r< Ol- 

0<r<o-lp 

We have 

(3.11) E(Zi) = P(Z„ > a)E(ZHZ„ > a) + P(Z„ < b)E{Z\ \ Z. < 6), 

Limits for E{Z\) can be obtained by replacing the conditional expected 
values in the right hand member of (3.11) by their limits given in (3.9) and 

(3.10). 
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A NOTE ON CUMULATIVE SUMS 

By Charles Stein 


Columbia University 


Let [Z,} be a denumerable sequence of identical independent real-valued 
random variables Two constants a > 0 > b are chosen and the random variable 

n 

n defined as the smallest integer for which one of the inequalities > a 

1 

n 

< h holds. For any events Ex and Ej, P[Ei] will denote the prob- 

1 

ability of the event Ei and P{Ei\Ei\ the conditional probability of the event 
El given that Ei has occurred. 

It will be shown that there exists > 0 such that the moment generating 
function, Ee" ‘ exists for any complex number t whose real part is less than or 
equal to <o , and as an immediate consequence that n has finite moments of all 
orders. 

If d is any constant satisfying h < d < a, then, for fixed m, 


( 1 ) 




where c = | o | + j h |. We exclude the case P(Z, = 0] = 1. Then there 
exists « > 0 such that either 


= P{Z, > <) > 0 or 82 = P{Zr < -«) > 0. 


Taking, for example, the' former alternative with mi 



1 . 


( 2 ) 




81^ > 0 


where [t«] denotes the largest integer less than or equal to w. For any poitiye 
integer k, 


P^ > T k X* - U».) 


Atmi 


<P{b<^Z^<ah<^Zi<a for s — 1, l)mi 

1 1 


since n > km implies b < ^ Zi < a. 

1 

But ^Zv= ^ Zi ^nd the second sum on the right hand 

1 1 (t-llmi+l 

side is independent of all terms in the first sum. 



CUMULATIVB SUMS 


499 


Thus the distribution of Zi given 22, for s = 1, ■ • (fc - l)mi de- 

1 1 

(fc—l)mi 

pends only on ^ Z, so that 
1 


P[n>kmi] 
P[n> {k - l)i 


E Z.+ t ^<ab< t 2.<a 


<P{ E 2. <c <1 - 5? by (1) and (2). 
\ J 


Consequently, by induction on fc, 


P[n > m} < P |« > ^ mi| < (1 - . 


Let U be any positive number less than -—log (1 - SD. 

Ml 

Then 


Ee^‘^ = E V'Tin = m} 

frlpal 

ee 

<'£e^”'^‘’‘P{{k-l)mi<n<]mi} 

( 5 ) < E >(fc - l)mi) 

k-l 

< E 

ifc-i 

= E U”M - 

1 “ fli i-i 

But this is a geometric series with decreasing terms, and is consequently con¬ 
vergent. Thus for any t whose real part R{t) < , the moment generating 

function Pe"‘ exists. Since, for all positive I, m‘ < e”“" for sufficiently large 
m, n has finite moments of all orders. 
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1, A Test of Randomness in-Two Dimensions. Howard Levene, Columbia 

'University. 

A square of side JV is divided into N* unit cells, and each coll takes on the characteristics 
A or 5 with probabilities p and q ■= 1 — p respectively, independently of the other cells. 
A cell IB an “upper left corner” if it is A and the cell above and cell to the left are not A. 
Let y, be the total number of upper loft corners and let Vi , Vj , Vt be the number of simi¬ 
larly defined upper right, lower right, and lower left corners respectively. Let V = (Vi -\- 
Fz + Fa + P<)/4. It is proved that F is normally distributed in the limit with S(F) = 
p(Nq + py and ff“{F) N^pq^(i — 20p + 45p> — 27p>)/4. The conditional limit distribu¬ 
tion of F when p is estimated from the data, and the limit distribution of a related quadratic 
form are also obtained These statistics are in a sense a generalization of the run statistics 
used for testing randomness m one dimension. 


2. Asymptotic Distribution of Moments from a System of Linear Stochastic 
Difierence Equations. Herman Rubin, Cowles Commission for Research 
in Economics. 


Let -1- Vz[ = uI , (t = 1,2, • • •), bo a complete system of linear stochastic 

diftetenoe equations determining i/i, (the coordinates of j/i), t > 0, iii terms of Vh , I 0, 
and 2 a (the coordinates of zi), which are assumed to be fixed variates, and the random 
variables (the coordinates of Ui) Such a system is called a stable if for every bounded 
set of fixed variates, and E(u',ut) uniformly bounded, E(v',pt) is uniformly bounded. This 
condition is shown to be equivalent to ^ 1 h,,r \ finite, where y'l »= ^"„o r — 

+ Ji.vu'-u is the solution of the above dilTorence equation. Let Qi be an infinite 

quadratic form in and (t, r = 0,1, • • •) with coefficients depending only on i, k, 
T, and V. Such a quadratic form is called convergent if the sum of the absolute values of 
the coefficients Is finite. It is shown under fairly general conditions that the mean of a 

convergent quadratic form is asymptotically normally distributed with variance 0 




3. Conditional Expectation and Unbiased Sequential Estimation. David 
Blackwell, Howard University. 

It IS shown that l!!I/(a;a)E„y] = E(Jy) whenever E(/y) is finite, and that <r^{EaV) < 
with equality holding only if E„y = y, where E„y denotes the conditional expectation of y 
with respect to the family of chance variables s* . These results imply that whenever 
there is a sufficient statistic u and an unbiased estimate t, not a function of u only, for a 
parameter p, the function E^i, which is a function of u only, is an unbiased estimate for p 
with variance smaller than that of I, A sequeUtial unbiased estimate for a parameter is 
obtained, such that when the sequential test terminates after i observations, the estimate 
is a function of a sufficient statistic for the parameter with respect to these observations. 
A special case of this estimate is that obtained by Girshiok, Hosteller, and Savage (Anaois 
oj Math, Slat., Vol, XVII (1946), pp. 13-23) for the parameter of a binomial distribtion. 


4. A Discuqsioa of the Ehrenfest Model. Preliminary report. Mark Kag, 
Cornell University. 

A particle moves along a straight line in steps A, the duration, of each step being t. 
The probabilities that the particle at kA will move to the right or left are (1/2) (1 — k/R) 

SOO 
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and (1/2) (1 + h/R) respectively. R and k are integers and | fc | < M. C. Wang and 
G. E. Uhlenbeck in their paper On the theory of Brownian motion II {Rev Mod. Phys. Vol. 
17 (1945), pp 323-342) discuss this random walk problem and state several unsolved problems. 
In answer to some^of the questions raised the following results are obtained Let (1 — 

•(1 + z)®"*"’ = SC*' z (j an integer) then, the probability P(n, to 1 s) that a particle starting 
from 7 iA will come to mA after time t = sr is equal to 2”^®(-l)®'''"sCj/E)'ci7i’C'B+m . 
where the summation is extended over'all j such that b 1 < E Also, if R is even the prob¬ 
ability P'(n, 0 I s) that the paiticle starting from nA will come to 0 at i = sr for the first 
time IS calculated. For n = 0 this gives a solution of the so-called recurrence time problem 
first studied on simpler models by Smoluchowski. Through a limiting process in which 
7- —> 0, A —» 0, A“/2 t —> D, 1/Rt —i nA—f Xo , mA x, Sr = t, one is led to fundamental 
distributions concerning the velocity of a free Browman particle In particular, P(n, m | s) 
approaches the well-known Ornstem-Uhlenbeck distribution 

5. Sampling from Contaminated Distributions. Preliminary report. JohnW. 
Tukey, Princeton University. 

A contaminated distribution is a nearly normal distribution in which extreme observa' 
tions aie more frequent than in a normal distribution. By studying the bias and vari" 
ability of several measures of dispersion when applied to samples from particular one" 
parameter families of contaminated distributions it is shown that (i) for nearly norma^ 
distributions, the mean deviation is often better than the standard deviation; (ii) smal^ 
changes in the underlying distribution mayincrease the sampling variance of the standard 
deviation by a factor of three This suggests that, in a broad class of cases, the mean devia¬ 
tion IS safer than the standard deviation when a single dispersion is estimated from a set 
of data This conclusion need not apply in an analysis of variance situation, 

6. On the Class of Functions Defined by the Difference Equation (x + l)/(x + 1) 
= {a + bx ) f ( x ). Leo Katz, Wayne University 

The difference equation defines only three discrete functions the binomial, the Poisson 
and the Pascal functions, the first and third have one parameter {N) slightly generalized. 
It IS shown that the Pascal function with this generalization is identical with the Polya- 
Eggenburgher distribution, which is a very useful form of the Compound Poisson Law and 
has been used to explain probability situations involving contagion Areas foi all func¬ 
tions in the class are given in terras of existing tables of the incomplete 7 and /3-functions 
Observed distributions are fitted by two moments. As Carver {Handbook of Mathematical 
Statistics) pointed out, the advantages of fitting by difference equations are many, not the 
least is the fact that it is unnecessary to discriminate among the various functions in fitting 
an observed distribution. The problem of discrimination, posed by Frisch {Metron, Vol. 
10) and others, may be resolved in terms of the sampling distribution of variances for the 
Poisson function, since the three functions correspond to situations where the variance is 
less than, equal to, or greater than the mean, respectively 

7. Retention of Decimal Places in Matrix Calculations. Franklin E. Satter- 
thwaite, Aetna Life Insurance Company. (Read by title) 

The accumulation of errors in matrix calculations has been studied by the author and 
others for special types of matrices and for special methods of calculation In the present 
paper, error formulae are developed for the standard Doolittle and Waugh-Dwyer Compact 
routines. These formulae do not place any restrictions on the matrices involved and do 
not require any extra calculations or imtial approximations. Simple rules are developed 
which give for each step in the calculations the number of decimal places which must 
be retained. These rules are efficient in the sense that the retention of fewer places will, 
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except for good fortune in balancing of errora, lend to roflulte less accurate than those 
specified The rules also assist in choosing that arrangement of the calculations which 
will lead to the smallest average number of significant figures which must be retained 
for the calculation as a whole. 

8. The Efficiency of the Mean Moving Range. Paul G. Hoel, University of 
California at Los Angeles. (Read by title) 

The statistic w = I ^i+i — xt \ ■\/J/2(n - 1) is studied as an estimate of tr for a 

normal variable subject to trend effects. It is shown that the efficiency of to compares 
favorably with that of the mean square auccossivo difference, i*. The proof that is, and 
also 5*, IS asymptotically normally distributed is made to depend upon a general result 
that can be derived from a theorem of S. Bernstein on dependent variables. 


9. Some Basic Theorems for Developing Tests of Fit for the Case of the Non- 
ParametLc Probability Distribution Function. Bradford F. Kimball, 
New York State Department of Public Service. (R^ad by title) 

Given a universe with C D.F. P[JC < s] = F{x). Consider a random sample of n values 
Xi which have been ordered so that ic, < . The successive differences of the true c.d.f. 

values a,i X = Xi are denoted by ui. Thus 


«i “ fipi) 

111 = F(x,) — F(x,_i), 2 ^ i ^ n 
u„+i - I - F(x„). 

Theobbm 1. The product power moments 

for any or all different indices from !• /o n 4* 1, where the powers are real numbers greater than 
minus one, are given by 




r(n + 1) r(p + 1) r(q + D r(w + D • ■ • 
r(n +14-^4-9 + 11)4- ■■•) 


CoBOLLABY. If o range R(k, m) is defined by 


R{0, to) = F(i„), R(n + 1, w) = 1 - F(x„+i_„) 

R(k, to) = FCXt+m) — F(xi) 

where k and m are positive integers such that m < n and fc + to < n, its probability distribu¬ 
tion IS independent of k, and hence equal to that of F(»m). 

Theobbm 2. Given a teat funtion of ui 

m 


where p is a real positive number, and the sum is for w indices chosen at random on the range 
1 io » + 1. Let ? and ff* denote the mean and variance of this test function. Fstabliah a 
convention for increasing the indices included in the above sum for increasing mas n increases, 
such that [m/(n + 1)] = constant, to nearest multiple of l/(n + I). Then the asymptotic 
distribution of (F — Y)l<r for inreasing n, subject to the above condition, is the normal dis¬ 
tribution with zero mean and unit variance, except in the trivial case m = n + 1, p = 1. 
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10. Confidence Limits for the Fraction of a Normal Population which Lies 
between Two Given Limits. Jacob Wolfowitz, Columbia University, 
(Read by title) 

Let Si , Xjf be N independent observations from a normal population with mean jx 
and variance both unknown. Let Nn = 2 ®, and (N — l)s* = 2 (®i — define ® and s* 
Let Li and Li be given constants with Li < Lj, and let 

_ 

7 = (■\/2iro-)-i 1 

J Li 

By a lower confidence limit on 7 with confidence coeflficient a is meant a function D(xi , 

• • ■ , Xfr) such that the probability is a that D <y Since x and are sufficient estimates of 
p and 0-2 the restriction that Z) be a function of £ and s only is imposed It is assumed that 
there exist a) a positive d such that Li + d < p < Lj — d; b) a positive C such that a < C. 
From these it follows that there exists a lower bound (? = G(,d, C) on 7 . Let Xi-a be that 
number for which P(x“ < Xi-ol =1 — 0 :, where x* has JV — 1 degrees of freedom, and let 

w = — It is shown that if D be defined as follows: 

Xi.„ 

1 ) if Li < i < Li , 

p Lj—i/ v> 

D = (2ir)”l / exp 1-Jj/M dy 

J lii—x/tD 

2) B = G otherwise, then | P(B < 7 ! - a j approaches zero asN-* «>. Thus B is a large 
sample lower confidence limit. The extension to upper and two-sided limits presents 
no difficulty. 

11. The Consolidated Doolittle Technique, Paul Bobchan, Econometric 
Institute (Read by title) 

The quadratic matrix notation is interpreted as a segment in a sequence of matrices 
wherein each successor matrix is augmented by a bordering row and column. Extension 
theorems based on this idea date back into the last century. The step from the original 
concept to one of higher order is also fruitful in discussing inverse matrices, specifically 
the inverse of a symmetric matrix The symmetry of the matrix of normal equations for 
a set of multiple regression coefficients is restored by adding the transpose of the column 
on the right side of the equations, 1 e. the co-variances with the dependent variable and 
the variance of the dependent variable itself. The inverse of this matrix can be con¬ 
structed as partial sum over a senes of matrices. Each individual element of this senes 
18 in Itself meamngful. The solution for the se\ of multiple regression coefficients relating 
the fc-th variable to the preceding (fc - 1) variables is a column matnx. The product of 
this matnx with its transpose expressed in terms of the residual variance forms the fc-th 
term in the matrix senes. The summation of the first n products yields the inverse matrix. 
This characteristic of the inverse can be used to great advantage in the standardization 
of elementary computational steps 

12. Estimation of Structural Equations through Linear Transformation of 
Regression Coefficients. Theodore W. Anderson and Herman Rubin, 
Cowles Commission for Research in Economics. 

A method is presented for estimating the coefficients of a single structursA- equation m 
a system By', + Tz[ = u[ {i = 1,2, ■ ■■, T), where B and r are matrices of coefficients, y, 
IB a row vector of G observed jointly dependent variables, zt of K observed predetermined 


exp 




dy 
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variables and Ut of G random elements. Given the distribution of the random elements, 
the equations define the distribution of the yi . Some coordinates of 2 , may be coordinates 
of , etc. It is assumed that the structural equation to be estimated has at least 0 — 1 
coefficients prescribed zero. The part of the population regression matrix corresponding to 
the predetermined variables with zero coefficients has rank one less than the number of 
jointly dependent variables with non-zero coefficients. The maximum likelihood estimate 
of this matrix is a linear transformation of the unrestricted sample regression matrix. The 
estimated vector of coefficients of y < is the vector annihilated by this matrix. The vector of 
coefficients of zi is estimated by means of this vector and the regression matrix. Those 
estimates are consistent and asymptotically normally distributed. For Z( fixed, small 
sample confidence regions are given for the coefficients. 



NEWS AND NOTICES 


Readers are invited to submit to the Secretary of the Institute news items of interest 

Personal Items 

Dr. Armen A. Alchian, who has been discharged from the Army with the rank 
of Captain, is now an Assistant Professor m the Economics Dept at the Uni¬ 
versity of California at Los Angeles. 

Dr. Franz L. Alt is now Assistant Director of Research at the Econometric 
Institute. 

Colonel Dinsmore Alter is on terminal leave after more than four years’ 
service in the Transportation Corps of the Army, and has returned to his duties 
as Director of the Griffith Observatory in Los Angeles. During these years 
Colonel Alter traveled approximately 250,000 miles on the ocean as a Trans¬ 
port Commander, visiting each continent except the Antarctic 

Dr Theodore W. Anderson, formerly with the Cowles Commission, is now an 
Instructor in the Dept of Math. Statistics at Columbia University, and plans 
to be on a Guggenheim Fellowship beginning in June 1947. 

Mr Herbert Barkan has been appointed to an Instructorship in the Newark 
College of Engineering 

Mr. Robert E. Bechhofer, formerly a statistician with The Kellex Corpora¬ 
tion, is a graduate student at Columbia University this year. 

Mr. Stanley G. Behrends is now Cost Accountant with the California Wire 
Cloth Corporation, in Oakland. 

Messrs. Carl A. Bennett, Jack I. Northam, and Max A. Woodbury have 
all returned from various types of war service to the University of Michigan 
as graduate students in statistics. Mr, Bennett was with the Manhattan En¬ 
gineering District for over two years, first at the Metallurgical Lab., University 
of Chicago, and then at Oak Ridge, Tenn. Mr. Northam was recently dis¬ 
charged from the Army with the rank of Lieutenant, having served with the 
Signal Corps for four years in the Pacific area. Mr. Woodbury was discharged 
from the Army with the rank of Captain, having been in the Meteorology serv¬ 
ice for five years, most of which time was spent in the European theater. 

Mr. Richard Berger has received his discharge from the Navy, and is employed 
as a Research Analyst with Dun and Bradstreet. 

Dr. Archie Blake, formerly at Aberdeen Proving Ground, is now Senior 
Statistician in the Office of the Army Surgeon General 

Dr. Ernest E. Blanche, who had been teaching in one of the European Army 
University Centers, is now Principal Admimstrative Analyst in the Plans and 
Policy Office of the War Department General Staff, and is also Lecturer at 
American University. 

Mr Royal F. Bloom has resigned the position which he held for a short time 
with the Psychology Dept, of Iowa State College after his release from the 
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Navy last March, and has returned to the Navy Department as Assistant Head 
of the Classification Research Division, Bureau of Naval Personnel. 

Mr Earl K. Bowen has been appointed to an Instnictorship m statistics at 
Babson Institute of Business Administration 

Mr Albert H. Bowker is enrolled this year as a graduate student at the Uni¬ 
versity of North Carolina. 

Mr. Charles R. Brearty has joined the Technical Staff of Bell Telephone 
Laboratories, Inc. 

Mr Clyde A. Bridger is on leave from his position at the University of Utah, 
and is spending the year at the Institute of Statistics in Raleigh, North Carolina. 

Mr. Arthur W Brown, formerly with the Columbia University Division of 
War Research, is now with the Standard Oil Company of New Jersey. 

Dr George W. Brown, formerly connected ivith the RCA Laboratories at 
Princeton as Research Engineer, has accepted a position as Research Associate 
Professor in the Statistical Laboratory at Iowa State College, 

Mr. Richard H. Brown has been appointed to a Lectureship in Mathematics 
at Columbia University. 

Ur Richards. Burington, Director of the Evaluation and Analysis Groups of 
the Research and Development Division of the Bureau of Ordnance, Navy 
Department, has been named Chief Mathematician, Bureau of Ordnance. 

Mr. Roy A. Chapman, who has been Silviculturist at the liitchiti Exper¬ 
imental Forest, Round Oak, Georgia, is now with the U S. Forest Service m 
Washington, D. C. 

Dr. Way Ming Chen has been appointed to an Instructonship in mathematics 
at Brown University. 

Dr John M. Clarkson has been promoted to a professorship at North Carolina 
State College. 

Mr. S. Lee Crump has been promoted to an Assistant Professorship at Iowa 
State College. 

Dr. Joseph F. Daly, formerly an Instructor at Catholic University, and more 
recently a Lieutenant in the Navy Department, is now Statistician, with the 
Bureau of the Census. 

Dr. Daniel B. DeLury has been promoted to a professorship in statistics at 
Virginia Polytechnic Institute. 

Dr. Acheson J. Duncan has been appointed to an associate professorship of 
political economy at The Johns Hopkins University. 

Dr Jack W. Dunlap, foimerly at Rochester University and more recently a 
Lieutenant Commander in the U. S. Navy, is now Director of tho Division of 
Biomechanics of the Psychological Corporation. 

Mr. Francis B. Elmore has been discharged from the Army and is Quality 
Control Engineer at the Union Bag and Paper Company, in Savannah, Ga. 

Mr. Mark W. Eudey has returned from service to his former position with the 
Statistical Laboratory at the University of California. 
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Mr. Charles D Ferris, formerly at Aberdeen Proving Ground, is now Quality 
Control Engineer with the General Electric Company, in Bridgeport, Conn. 

Mr Lester E,. Frankel is now a statistician with Dun and Bradstreet 

Mr John E. Freund has accepted a position as assistant professor of mathe¬ 
matics at Alfred University. 

Dr Bernard Friedman has been promoted to an assistant professorship at 
New York University. 

Dr. Milton Friedman has been appointed to an associate professorship in the 
Department of Economics at the University of Chicago. 

Mr. G Rupert Gause is now with the Technical Staff of the Bell Telephone 
Laboratories 

Professor Edwin L Godfrey has been appointed Head of the Department of 
Mathematics and Astronomy at Defiance College 

Dr Casper Goffman has been appointed to an assistant professorship in the 
Department of Mathematics at the University of Kentucky 

Mr Harry H. Goode is now a Mathematician in the Office of Research and 
Inventions, U. S. Navy 

Mr. Robert D Gordon is a Teaching Assistant in Mathematics at Indiana 
University. 

Mr Bert A Gottfried has returned from the service and is Research Analyst 
with Dun and Bradstreet. 

Dr. Clyde H. Graves, formerly at Pennsylvania State College, is now Opera¬ 
tions Branch Chief of the Office of Price Board Management, OPA. 

Dr. Joseph A Greenwood has recently been separated from active duty with 
the Navy and is now a statistician in the Bureau of Aeronautics. 

Mr. Harris T Guard has returned to Colorado A. and M as an Instructor in 
the Department of Mathematics. 

Dr Joy P. Guilford has returned to his former position as Professor of Psychol¬ 
ogy at the University of Southern California 

Prof. Emil J. Gumbel, formerly with the New School of Social Research, 
has been appointed to a Special Lectureship in Statistics at Newark College of 
Engineering 

Mr. Lee S. Gunlogson has been discharged from the Navy and is now in the 
statistical department of the Lumbermens Mutual Casualty Company, Chicago 

Dr Paul R. Halmos has been appointed to an assistant professorship in mathe¬ 
matics at the University of Chicago. 

Professor Preston C. Hammer has returned to his former position at Oregon 
State College. 

Mr. Joseph 0. Harrison, Jr. is now employed as a mathematician for the 
Harvard University Automatic Sequence Controlled Calculator Project in 
Cruft Laboratory 

Mr. Millard Hastay, formerly with the Statistical Research Group at Co¬ 
lumbia University, is now Research Associate at the National Bureau of Eco¬ 
nomic Research. 
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Mr. Bernard Hecht has been promoted from Chief Quality Control Engineer 
to Manager of the Quality Control Department of the International Resistance 
Company, Philadelphia. 

Mr. Joseph L. Hodges, Jr. has been appointed to a teaching assistantship in 
mathematics at the University of California 

Dr. Paul G. Hoel has been promoted to an associate professorship in mathe¬ 
matics at the University of California at Los Angeles. 

Mr Richard A. Hornseth has been appointed to an metnictorship in the De¬ 
partment of Sociology and Anthropology at the University of Wisconsin. 

Mr. Harry M. Hughes has been appointed to a teaching assistantship at the 
University of California. 

Mr. Leonid Hurwicz, formerly with the Cowles Commission, lias been ap¬ 
pointed to an associate professorship at Iowa State College. 

Mr. Joseph B. Jeming has been separated from service with the Air Forces, 
and is now a Financial and Economic Consultant in New York City. 

Mr Paul Johner has been discharged from the Army and is now in the Indus¬ 
trial Engineering Division of the Aluminum Company of America, New Kens¬ 
ington, Pa. 

Miss Margaret Kampschaefer, who is a statistician in the War Department, is 
now serving in the Supply Division of the Air Force Service Command in Erlan¬ 
gen, Germany. 

Dr, Leo Katz has been appointed to an assistant professorship at Michigan 
State College. 

Mr. Frederick G, King has been discharged from the Army and is now a 
civilian instructor in the Anti-Aircraft Artillery School at Fort Bliss. 

Dr. Tjalling Koopmans has been appointed Associate Professor of Economics 
at the University of Chicago. 

Mr. Paul J. Kopp has been discharged from the Army and is now with the 
Patent Department of the Gulf Oil Corporation, Washington, D, C. 

Dr, Carl F. Kossack has accepted a position as mathematician with the 
Joint Army-Navy Air Intelligence in the Strategic Vulnerability Branch. 

Dr Waclaw Kozakiewicz has been promoted to an assistant professorship 
in ihathematicB at the University of Saskatchewan. 

Professor Rafael Laguardia has returned to Uruguay as Director of the In¬ 
stitute de Matematica y Estadistica, Pacultad de Ingenieria, 

Dr. Charles R. Langmuir, formerly with the Psychological Corporation, is 
now Secretary-Treasurer and Lab. Director of the Bennett and Langmuir 
Development Corporation, Mamaroneck, N. Y 

Mr. Charles M. Larson has accepted a position as mathematician with the 
Pacific Mutual Life Insurance Company, Los Angeles. 

Miss Lucy A. LaSala, formerly with the research group at Columbia Uni¬ 
versity, is now teacher of mathematics at East New York Vocational High 
School. 

Dr. Richard A. Leibler is now a Member of the Institute for Advanced Study, 
Princeton. 
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Miss Grace L Lesser, formerly with the research group at Columbia Uni¬ 
versity, is now employed as a statistician with the Econometric Institute, 

Miss Myra Levine has accepted a position as statistician with the Socony- 
Vacuum Oil Company, in New York City. 

Dr Jerome C R. Li has been appointed to an instructorship at Oregon State 
College. 

Professor William T. Martin has accepted a professorship m the Department 
of Mathematics at Massachusetts Institute of Technology. 

Miss Ethelyne L. McBee, formerly with the U. S. Department of Agriculture, 
is now teaching science and mathematics at the Falls Church High School, 
Falls Church, Virginia 

Dr. Paul W. McGann has been appointed to an assistant professorship in 
economics at American University. 

Dr Max F. Millikan has been appointed to a research associateship at Yale 
University 

Mr. Probodh C. Mittra has accepted a position as consulting statistician with 
the United Nations Economic and Social Council. 

Dr. Marjone E Moore has transferred from her position as statistician with 
the Social Security Administration, to one as Program Analyst in the Office of 
Vocational Rehabilitation, Federal Security Agency. 

Miss Judith Moss, who was with the research group at Columbia University, 
IS now research assistant with the National Bureau of Economic Research 

Dr Frederick Mosteller has been appointed to a lectureship and research 
associateship in the Department of Social Relations at Harvard University. 

Mr. James E. Myers, formerly with the Naval Research Laboratorj'- at Ana- 
costia Station, is now with the research group of the Moore School of Electrical 
Engineering, University of Pennsylvania 

Mr. Stanley W Nash is a graduate student this year at the University of 
California 

Professor J. Neyman is on leave from his position at the University of Cali¬ 
fornia for the fall semester, and is Visiting Professor of Mathematical Statistics 
at Columbia University 

Mr. Russell T. Nichols has been discharged from the Army, and is a graduate 
student at the University of Chicago 

Mr Harold Nisselson has been discharged from the Navy, and is now a statis¬ 
tician in the Bureau of the Census, where he was formerly employed. 

Professor Nilan Norris has been separated from his service with the Army, 
with the rank of Major, and has returned to his position in the Department of 
Economics at Hunter College 

Dr. Guy H. Orcutt, formerly at Massachusetts Institute of Technology, has 
accepted a research position in the Department of Applied Economics, Cam¬ 
bridge University. This new department is to be modelled somewhat along the 
lines of the Cowles Commission at the University of Chicago, and is to be under 
the direction of Dr. J. R N. Stone. 
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Mr. Warren H Page has been separated from sen vice with the Army, and is 
now a graduate student at Columbia Universtty. 

Mr. Nicholas Pastore has been appointed to an instnuitorship at Union Junior 
College, Cranford, New Jersey. 

Mr. I. B. Perrott has been demobilized from the British Army with the rank 
of Major. 

Mr George W Petrie, III hsia accepted a position iis Special I'lngincer with 
the Bethclehem Steel Company. 

Dr. Harry S. Pollard has been promoted to a profes.sorship at Miami Uni¬ 
versity. 

Dr. G. Baley Price, Profcssoi of Mathematics at the University of Kansas, 
has been awarded a Post-Service Guggenheim Fcllow.ship, beginning September 
1, 1946. 

Mr. Iloliert J Randall has been discharged from the Army and is now a 
graduate student at Columbia Univcr.sity 
Professor Lowell J. Reed, of the School of Hygiene and PuVilic Health, The 
Johns Hopkins University, has been appointed Vice-President of the University. 

Dr. Francis Regan has been promoted to a professorship at St. Jjouis Uni¬ 
versity. 

Mrs. Kathryn B. Rolfe, formerly at the University of (California at Berkeley, 
has accepted a position as associate in matheniaties at the University of Cali¬ 
fornia College of Agriculture, at Davis. 

Mr Frank Saidel is a graduate student in mathematical statistics this year at 
Columbia University. 

Dr. Leonard J. Savage has been awarded a Special Rockefeller Fellowship, 
beginning September 1946. 

Professor Henry Seheffd of the University of Ckilifornia at Los Angeles has 
been awarded a Guggenheim Fellowship, and is spending the year at the Uni¬ 
versity of California at Berkeley 

Professor Andrew S. Schultz, Jr. has been separated from service with the 
Army and has returned to Cornell University with the rank of a.ssociate pro¬ 
fessor. 

Dr Saul B. Sells, formerly with the OPA, has accepted a position as Assistant 
to the President of the A. B. Frank Company, San Antonio. 

Mr Lawrence W. Shaw is now a statistieian with the U S. Public Health 
Service in Bethesda. 

Dr. Ronald W. Shephard has been appointed to a lectureship at the Uni¬ 
versity of California, Berkeley. 

Mr Clifford R. Simms has accepted a position as manager of the Cleveland 
office of the B. E Wyatt Company. 

Mr George B Simon has been separated from Army service with the rank of 
major, and has accepted a civilian position as chief of the Analysis and Research 
Unit, Psychological Section, Office of Surgeon, Barksdale Field. 

Mr. Herbert Solomon has been appointed to an instructorship at the College 
of the City of New York. 

Mr. Melvin D. Springer has returned to the University of Illinois and has been 
appointed to an assistantship. 
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Mr. Andrew P. Stergion has been discharged from the Army, and is now Sta¬ 
tistical and Quality Control Engineer with the Corning Glass Works. 

Mr Milton S. Stevens has been discharged from the Navy, and has accepted 
a position as Director of Special Projects with Time, Inc 

Dr. George J. Stigler has been appointed to a professorship in economics at 
Brown University. 

Mr Alexander L Stott has been discharged from the Navy, and is now a staff 
assistant in the Treasury Department of the American Telephone and Tele¬ 
graph Company 

Dr. L V Toralballa has accepted a teaching position at Fordham University 

Dr. Walter R. Van Voorhis has returned to Fenn College, with the rank of 
associate professor. 

Mr. Edward H Van Winkle has been appointed to a professorship of business 
statistics at Rensselaer Polytechnic Institute 

Dr Charles W. Vickery has been appointed to an associate professorship at 
Ohio State University. 

Mr. David F Votaw, Jr. has been separated from service with the Navy and 
has returned to Princeton University as Research Associate 

Mr. W. Allen Wallis has been appointed to a professorship at the University 
of Chicago. 

Mr. Ralph E. Wareham is now managing director of the National Photocolor 
Corporation 

Dr Jacob Wolfowitz has been appointed to an associate professorship in 
mathematical statistics at Columbia University. 

Mr John F. Wyckoff, formerly at Trinity College, has accepted a position in 
the Research Division of the Actuarial Department, Connecticut General Life 
Insurance Company, Hartford. 

Mr. Earl K. Yost, Jr. has been appointed to a graduate assistantship in 
mathematics at the University of Oregon. 


A conference on applied mathematical statistics was held at Lake Junaluska, 
North Carolina, August 4-9, 1946 under the sponsorship of the Institute of 
Statistics of the University of North Carolina The following individuals at¬ 
tended the conference: C I. Bliss, W- G Cochran, Gertrude M. Cox, D. B. 
Duncan, C Eisenhart, R. A Fisher, Carl F Kossack, Frederick Hosteller, 
H. W Norton, Paul Peach, Charles F. Roos, Walter A. Shewhart, Frederick 
Stephan, Gerhard Tintner, John W. Tukey, S. S. Wilks, C. P. Winsor, and J. 
Wolfowitz 


Newark College of Engineering is sponsoring a series of conferences on In¬ 
dustrial Statistics. The first of these, on Acceptance Sampling, began on Sep¬ 
tember 27 and ran for eleven four-hour Friday sessions. Among the members 
of the Advisory Panel on Industrial Statistics are Institute members S. B. 
Littauer, A. I. Peterson, W. A Shewhart, and S. S Wilks. 
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New Members 

Thefollomng persons have been elected to membership in the InsMnte; 

Anscombe, F. J. Rothamsted Experimental Station, Ilarpenden, Horta, Eng. 

Back, Kurt W., M.A (California at L. A.) Stat, Surveillance Branch, Ballistic Rea Lab 
Aberdeen Proving Gd., Md. 

Bresnalian, Maurice F. Stnt., U. S. Bur. of Labor Statistica, Wash., D. C., Apt. SOS, WIB 
N St, N.W., Wash. 1 

Chung, Kal«Lal, M.A. (Princeton) Graduate Coll., Princeton Univ,, Princeton, N. J, 
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REPORT ON THE ITHACA MEETING OF THE INSTITUTE 

The Ninth Summer Meeting of the Institute of Mathematical Statistics was 
held at Cornell University, Ithaca, New York, on Thursday, August 21, and 
Saturday, August 23, 1946. The meeting wa.s held in conjunction with the 
summer meetings of the American Mathematical Society and the Mathematical 
Association of America. The following 71 members of the Institute attended 
the meeting: 

P. L. Alger, C. B Allencloerfor, T. W. Anderaon, Jr,, J L. BarncB, E. E. Blanche, Paul 
Boachan, A H. Bowker, A. E. Brandt, It S Burington, W. G. Cochran, E P. Colemam 
H. B. Curry, J. 11 Curtiflfl, J. L. Doob, J Dutka, P. S Dwyer, B Epstein, Will Feller, C- 
D Ferris, R. M Foster, J. E. Freund, M. A Girschick, A. A Goodman, Louis Guttinan, 
W. W, Gutzman, P R Halmoa, T. E Harria, Bertha I. Hart, E. H C. Hildebrandt, P. G. 
Heel, R. H. Hoakius, Harold Hotelling, W W. Jacobs, T. J. Jaramillo, Evan Johnaon, Jr., 
H. L. Jonea, Mark Kac, Irving Kaplanaky, Leo Kart, Tjalling Koopmans, C. F. Koasaok, 
M. M, Lavin, Walter Leighton, Jr,, Howard I.«vene, M. S Macplvail, J. W. Mauohly, P. J. 
McCarthy, E. C. Molina, Margaret E. Moore, J. E. Morton, L F. Nanni, P. M. Ncurath, 
E. G, Olds, G. B Price, C. J. Rees, Selby Robinson, Herman Rubin, P J. Rulon, Arthur 
Sard, F. E. Satterthwaite, I. E. Segal, G. R. Seth, Andrew Sobezyk, Herbert Solomon, C M, 
Stein, F. P, Stephan, A. P. Storgion, A. W. Tucker, J. W. Tukey, J L. Ullman, Abraham 
Wald,S S. Wilks. 

The first session, a joint session with the American Mathematical Society, 
was held on Thursday morning, and was devoted to contributed papers. Pro¬ 
fessor W. G. Cochran, President of the Institute, presided. The following 
seven papers were presented; 

1. A Test of Randomness in Two Dimensions. 

Mr. Howard Levono, Columbia Umveraity. 

2 . AsymjitoUc Dislribution of Moments from a System of Ijtncar Stochastic Difference 

Equations 

Mr, Herman Rubin, Cowles Commission for Research in Economics. 

3 Conditional Expectation and Unbiased Sequential Estimation. 

Professor David Blackwell, Howard TJmversity. 

4. A Discussion of the Ehrenfest Model Preliminary report. 

Professor Mark Kao, Cornell University 

6 . Sampling from Contaminated Dislnbulions. Preliminary report. 

Professor John W. Tukey, Pnneoton University. 

6 On the Class of Functions Deflnvd by the Difference Equation (» + 1) /(a: + 1) = 

(a + bx) fix). 

Dr Leo Katz, Wayne University. 

7. Retention of Decimal Places in Matrix Calculations 

Dr. Franklin E. Satterthwaite, Aetna Life Insurance Company. 

The following four papers were presented by title, 

8 . The Efficiency of the Mean Moving Range. 

Professor Paul G. Hoel, University of Califorma at Los Angeles. 

9. Some Basic Theorems for Developing Tests of Fit for the Case of the Non-Paramelric 

Probability Distribution Function. 

Mr. Bradford F, Kimball, N. Y, State Department of Public Service, New York 

City. 
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10 Confidence Limits for the Fraction of a Normal Population Which Lies Between Two 

Given Limits 

Professor Jacob Wolfowitz, Columbia University. 

11 The Consolidated Doolittle Technique 

Dr Paiil Boschan, The Econometric Institute, Inc. 

Abstracts of all these papers appear elsewhere in this issue of the Annals. 

At two o’clock on Thursday afternoon there was a joint session with the Ameri¬ 
can Mathematical Society which featured the invited address of Professor J. 
L. Doob of the University of Illinois on Probability in Function Space. This 
address was followed by a business meeting of the Institute which featured 
reports by the President, the Secretary-Treasurer, the Editor, and Professor 
Feller, who spoke for the recently created committee on the distribution of the 
Annals in the war areas. 

On Thursday evening there was a joint dinner with the American Mathemati¬ 
cal Society and the Mathematical Association of America. 

The meeting closed with a session on Friday mornmg devoted to the topic, 
Multivariate Analysis for Non-Experimental Data Professor Will Feller, of 
Cornell Universitj’’, presided. Professor T. Koopmans, of the Cowles Com¬ 
mission for Research m Economics, presented a paper entitled Statistical Infer¬ 
ence in Dynamic Economic Models. Dr T W. Anderson, Jr. presented a paper 
written by himself and Mr. Herman Rubin entitled Estimation of Structural 
Equations through Linear Transformation of Regression Coefficients. The meet¬ 
ing concluded with a discussion of these papers 

P. S. Dwyeh, 
Secretary. 



REPORT OF THE PRWCETON MEETING OF THE INSTITUTE 


The twenty-third meeting of the Institute of Mathematical Statistics was 
held in Princeton, New Jersey on Friday, November 1,1946, in connection with 
the year-long Celebration of the Bicentennial of Princeton University. The 
meeting was devoted entirely to Andy sis of Variance. The meeting was at¬ 
tended by 118 persons including the following 96 members of the Institute: 

Adam Abruzzi, Forman S. Acton, It. L. Anderson, T. W. Anderson, Jr., M. S. Bartlett, 
Robert Beohofer, Gilbert W. Beebe, J. H. Bigelow, Archie Blake, C I. Bliss, A. E, Brandt, 
Burton H. Camp, George C. Campbell, A. George Carlton, Kai Lai Chung, W G. Cochran, 
Gertrude Cox, Harold Cram6r, S. Lee Crump, J. H. Curtiss, Joseph F. Daly, Besse B. Day, 
D B DoLury, V V. Divatia, J. Dutka, Churchill Eisenhart, B. Epstoin, H. L. Panshaw, 
Nicholas Fattu, Will Feller, Merrill M, Flood, Bernard Friedman, Hilda Geiringer, H. II 
Goldstino, Joseph A, Greenwood, E. J. Gumbel, Margaret Gurney, L. Gutmann, T. E. 
Harris, Millard Hastay, Irwin S. IIolTcr, C J. Kirchen, B. F. Kimball, Lila F. Knudsen, 
H. S, Konijn, Jack Laderman, J. D Maddrill, Sophie Marcuse, H, C. Matliisen, J. W. 
Mauchly, Margaret Merrell, Elmer B Mo<le, Margaret E. Moore, J E. Morton, Judith 
Moss, F Mosteller, Charles M. Mottley, Ray B. Murphy, P. M. Neurath, Hugo Nilson, 
Gottfried E Noether, Monroe L. Nordon, H. W. Norton, C O. Oakley, P. B. Olmstead, 
J.G Osborne, Ellis It Ott, C. J Ileos, W. A. Reynolds, A. C. Itosandor, David Rosenblatt, 
Ernest Rubin, P. U. Rulon, Frank Saidcl, Marian M. Sandomiro, Walter A. Showhart, 
James G. Smith, Milton Sobol, Herbert Solomon, Mortiner Spiogclmon, Charles M. Stein, 
G. R. Stibitz, John It. Tomlinson, Marion M. Torrey, John W, Tukey, D. F. Votaw, Jr., 
F. M. Wadley, Alton J. Wadman, A. Wald, Robert M. Walter, Lionel Woisa, Frank Wil- 
cQxon, S. S. Wilks, C, P, Winaor, J. Wolfowitz, and W. J. Youdon. 

At the morning aession the following program was presented with Professor 
S, S. Wilks of Princeton University as chairman: 

Topic Malhmatical Approaches to the Analysis of Variance 

Papers: Two Probability Models for the Analysis of Variance 

Professor A. Wald, Columbia University 
Applications of Analysis of Variance 

Professor M. S. Bartlett, Cambridge University and The University of 
North Carolina 

Discussion' Professor S. L. Crump, Iowa State College 
Dr. J, F. Daly, Bureau of the Census 
Professor J. W. Tukey, Princeton University 
Professor C. P. Winsor, Johns Hopkins University 
Professor J WoICowitz, Columbia University 

The program for the afternoon session, under the chairmanship of Professor 
Will Feller of Cornell University, was as follows: 

Topic: Mxdtivanaie Problems in the Analysis of Variance 

Papers: Analysis of Couariance 

Professor W. G. Cochran, The University of North Carolina 
Feefor Methods 

Professor J W. Tukey, Pnneeton University 
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Discussion. Professor T W Andeison, Columbia University 
Professor C I. Bliss, Yale University 

Professor Harold Cramer, The University of Stockholm and Princeton 
Uiuversity 

ProfessorD. B.DeLury, Virginia Polytechnic Institute 
Professor P L. Hsu, The University of North Carolina 

The evening session consisted of round table discussion on Unsolved Problems 
of the Analysis of Variance, with Professor Gertrude M. Cox as chairman. 

Members of the Institute and others who attended the meeting were guests 
of the Institute for Advanced Study at tea in Puld Hall from 4 to 6 P.M. Those 
attending the evening session were guests of Princeton members of the Institute 
for refreshments in Fine Hall from 10 to 11 P.M. 

P. S. Dwyer, 
Secretary. 



MEMBERS OF THE INSTITUTE OF MATHEMATICAL STATISTICS* 

(Ai of October 1, 191)6) 

(The names of Fellows of the Institute are designated by * and Life Members by f) 
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Pa., 101 W. Broadway, Salem, N J. 

Altchlson, Beatrice Ph.D (Johns Hopkins) Econ. and Stat. Analyst, Interstate Com¬ 
merce Comm., Wash. 26, D. C., 19IS9 S St., N-W., Wash. 0 
Alchlan, Asst. Prof. Armen A. Ph.D. (Stanford) Econ. Dept, Univ. of Calif., Los 
Angeles, Calif. 

Alger, Philip L. M.S (Union) Staff Asst, to Mgr. of Engr Apparatus Dept, General 
Elec Co,, 1 River Rd., Schenectady, N Y , 1T68 Wendell Ave , Schenectady 8 
Allen, Prof. Roy 6. D. D Sc. (London) London School of Econ , Houghton St., Aldwych, 
London, W C. 2, Eng , 11 Christchurch PL, Epsom, Surrey 
Allendoerfer, Prof. Carl B. Ph.D. (Princeton) Hnverford Coll., Haverford, Pa., 750 
Rugby Rd., Bryn Jl/amr 

Alt, Franz L, Ph.D. (Vienna) Asst Dir. of Rea., Econometric Inst., 600 Fifth Ave., 
N. Y 18, N Y., S71 Fori Washington Ave , N Y.SS 
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Anderson, Paul H. Ph.D (Illinois) Economist, War Assets Adm., Wash,, D. C , 1SS8 
Blair Mill Rd., Silver Spring, Md, 

Anderson, Aaso. Prof. Richard L. Ph.D. (Iowa State) Inst, of Stat., N, C. State Coll., 
Raleigh, N, C., Box 5494, Stale Coll. Station 

Anderson, Theodore W., Jr. Ph.D. (Princeton) Instr., Dept, of Math. Stat, Columbia 
Univ., N. y. 27, N. Y , S455 Lawndale, Evanston, III 
Angell, Dorothy T. Stat. Analyst, Bell Tel. Labs., Murray Hill, N. J 
Anscombe, F. J. Rothamsted Experimental Station, Harpenden, Herts, Eng 
Arias, B. Jorge C.E. (Guatemala) 3 Avemda Sur 65, Guatemala City, Guatemala, C. A. 
Arnold, Prof. Herbert E. Ph D (Yale) Wesleyan Umv., Middletown, Conn , 167 High 
St 

Arnold, Asst. Prof. Kenneth J. Ph.D. (Mass Inst Tech.) Dept, of Math., Univ. of 
Wis., Madison 6, Wis , 7SS E Johnson St, Madison S 
Aroian, Leo A. Ph D. (Michigan) Instr., Hunter Coll, N Y., N. Y , 1S47 Wadsworth 
Ave., N. Y S3 

Arrow, Kenneth J. M A. (Columbia) Lydig Fellow, Columbia Univ., N. Y. 27, N. Y., 
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* Members were asked to supply fresh information for this Directory. The name is 
followed by highest degree and Institution granting it. Then follow the professional and 
business connections of the member, with business address, and finally (in italics) the home 
or mail address. When an address is known to be m error it is followed by (last address). 
Changes in addresses or errors in names, titles, or addresses, should be reported to the 
Secretary 
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Bachelor, S.o1)ert'W. M.B A. (Washington) Dir of the Res. C!ouncil, American Bankers 
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Beall, Geoffrey Ph.D. (London) Res. Asso , Inst, of Paper Chemistry, Appleton, Wis., 
160S N, Meade Si. 

Bechhofer, Robert E, A B. (Columbia) Grad. Student, Columbia Umv , N. Y., N. Y., 
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Victoria, Australia 



620 


MEMBER?; OF THE INSTITUTE 


Benford, Frank B.E.E. (Michigan) Physicist, IBJfi Rugby Rd., Schenectady 8, N. Y. 
Bennett, Prof. Albert A. Ph.D. (Princeton) Brown Univ., Providence, R. I. 

Bennett, Blair M. M.A. (Columbia) Res. Asst., U, S. Dept, of Agric , Wash., D. C., 
1410 M St., N.W. 

Bennett, Carl A. A.M. (Michigan) Grad. Fellow, Dept, of Math,, Univ. of Mich., Ann 
Arbor, Mich., P.O. Box 3, Saline 

Berger, Richard M.A. (Columbia) Res. Analyst, Dun and Bradstreet, 290 Broadway, 
N. Y., N. Y., 35 Rugby Rd., Rockmlle Centre 
Berkson, Joseph M.D., D.Sc, (Johns TIopkina) Chief, Div. of Biometry and Med. Stat., 
Mayo Clime, Rochester, Minn,, 457 14th, Ave., S.E. 

Berman, Abraham J. M.A. (Brooklyn) Stat., N. Y State Dopt. of Labor, 80 Center 
St I N. y., N, Y., 14BO College Ave., Bronx 

Berwick, Leonard A.B. (New York) Capt., AC, AAF School of Aviation Medicine, 
Randolph Field, Texas 

Bltkerstaff, Asst. Prof. Thomas A. M A. (Mississippi) Univ. of Mississippi, State 
College, Miss. 

Bigelow, Julian H. 631 W. ISSnd St, N. Y. 37, N. Y. 

Bingham, Marlon D. B.A. (George Washington) Acting Chief, Res Div , Veterans 
Adm. Branch Office No. 9, 314 N. Broadway, St. Louis, Mo , 3908 Biler St. 

Blmhaum, Asso. Prof. Z. William Ph.D. (Lwow) Univ. of Wash., Seattle B, Wash., 
B7SS Slsi Ave., N.B. 

Blackadar, Walter L. F A.S , F.A.I A,, B A. (McMaster) Asso. Actuary, Equitable 
Life Assurance Soo. of the U. S , 393 Seventh Ave., N. Y. 1, N Y., 40 Hillcretl Rd,, 
R D. HI3, Plainfield, N. J 

Blackburn, Prof. Raymond F. Ph.D, (Pittsburgh) Head, Dept, of Stat., Univ. of Pitts¬ 
burgh, Pittsburgh 13, Pa. 818 Country Club Dr,, Piileburgh 16 
Blackwell, Asso. Prof. David H. Ph.D. (Hhnois) Howard Univ., Wash., D. C., 3733 
Jay St, N.E. 

Bloke, Archie Ph.D. (Chicago) Sr. Stat., Ofliee of the Army Surgeon General, Wash. 
26, D. C., 3300 19th St., N.W., Wash. 10 

Blanche, Ernest E. Ph.D (IHinois) Prin. Adm. Analyst, War Dept. General Staff, 
43666 Pent^on Bldg., Wash. D. C. 0409 Montgomery Avenue, Chevy Chase IS, Md. 
*B11bb, Chester I. Ph.D, (Columbia) Biometrioian, Conn. Agrio. Experiment Station, 
Asso. Prof, of Biometry, Yale Univ., New Haven, Conn., 33 Edgehill Rd. 

Blommers, Asst. Prof. Paul J. Ph.D, (Iowa) Asst. Prof, and Registrar, State University 
of Iowa, Iowa City, Iowa, 3 Woolf Court 
Bloom, Rose B A. (Hunter) 1916 (hand Concourse, Bronx 67, N. Y. 

Bloom, Royal F. M.A. (Minnesota) Asst. Head, Classification Res., Bur. of Naval Per¬ 
sonnel, Navy Dept,, Wash. 26, D. C., 15 Hillside, Oreenbell, Md. 

Boddle, John B. Chief, Govt Sec , International Economics, Dept, of Commerce, Wash¬ 
ington, D. C. 3638 Tunlavo Rd., N W., Wash. 7 
Bonis, Austin J. B.S. (C.CNY.) Major, War Dept. Gen, Staff, Wash., D. C., 3600 
Que St., N.W. 

Bonnar, Robert U. M.S. (Washington) Chemist, Shell Development Co., 4660 Horton 
Ave , Emerjrvrille, Cahf,, 3969 Brook Way, San Pablo 
Boozer, Mary E. A.M, (Chicago) 90$ Park Ave., Richmond 35, Va. 

Borland, James M A (Indiana) 6733 Central Ave., Indianapolis, Ind. 

BoBchan, Paul Ph.D. (Vienna) Chief Analyst, The Econometno Institute, Ino,, 600 
Fifth Ave., N. y.48, N. Y., I04 West 40 St., N. Y. 18 
fBowen, Earl K. AM (Boston) Instr in. Stat., Babson Inst, of Bus. Adm., Babson 
Park, Wellesley Hills, Mass , 348 Union St., Norwood 
Bower, Oliver K. Ph.D. (Illinoia) Asso., Univ. of HI., Urbana, HI., SOS W. John, 
Champaign 



MEMBERS OP THE INSTITUTE 


521 


fBowker, Albert H. S.B. (Mass. Inst. Tech.) Student, TJniv of North Carolina, Chapel 
Hill, N. C., tOIf Gimghoul Rd. 

Brady, Dorothy S. Ph.D. (California) Chief, Cost of Living Div , Bureau of Labor 
Stat., Wash. 26, D. C , Bt7S Fulton Si., N W., Washington 13 
Brandt, Alva E. Ph.D. (Iowa State) Res, Specialist in Experimental Design and Anal¬ 
ysis, Soil Conservation Service, Dept of Agnc , Wash 26 D. C , Routs S Box 135, 
Vienna, Va, 

Brearty, Charles R. B.S. (Califorma) Member of Technical Staff, Bell Telephone Labs., 
Inc., 463 West St., N. Y. 14, N. Y., m-iO iUi Am., Little Neck, N. Y 
Breden, Robert E. B.S (Kansas State) Supv , Analysis and Records Div , The Proctor 
and Gamble Co., Sixthand Main Sts , Cincinnati 1, 0.,64S8 Mayflower Ave., Cincinnati 
U 

Breen, Nancy Brlxey (Mrs. J. P ) A B. (Vassar) 70 East 77 St., N. Y. 21, N. Y. 
Bresnahan, Maurice F. Stat, IJ, S Bur. of Labor Stat., Washington, D. C., Ayt. SOB, 
1015 N St., N.W. Wash. 1 

Brldger, Clyde A. M.S (Oregon State) Instr. in Math., Univ of Utah, Salt Lakb City, 
Utah (on leave), Instr. of Stat., State College Station, Raleigh, N. C. 

Brier, Glenn A.M. (George Washington) Meteorologist, U. S. Weather But., Wash. 
25. D. C 

Brlxey, Asso. Prof. John C. Ph.D. (Chicago) Umv of Okla , Norman, Okla. fl.®?'N Pick¬ 
ard St., Norman 

Bronfenbrenner, Martin Ph.D. (Chicago) Lt (] g ) USNR, Office of CINPAC, o/o 
Postmaster, San Francisco, Calif. 728 N, First Ave , Tucson, Arii 
Brookner, Ralph J. Ph.D. (Columbia) 1330 Broadway, Beaumont, Texas 
Brooks, Alvin G. B.A (Ripon) Chief of Inspection Tasks Sec., Western Elec. Co., 
Hawthorne Sta., Chicago, Ill. 4338 Lawn Avenue, Western Springs 
Brown, Asst. Prof. Arthur B. Ph.D. (Harvard) Queens Coll, Flushing, N. Y. 155-01 
SOth Ave., Apt 4F, Jamaica 2 

Brown, Arthur W. A.B (Pnnoeton) Standard Oil Co , Rm 3101, 30 Rockefeller Plaza, 
N. Y,, N. Y. 25 Wilner St, Madison, N. J 

Brown, Res. Asso. Prof. George W. Ph.D. (Princeton) Stat. Lab , Iowa State Coll. 
Ames, Iowa 

fBrown, Richard H. A B. (Columbia) Lecturer in Math., Columbia Univ., 531 W. 116th 
St, N. Y. 27, N. Y , 1310 John Jay Hall, Columbia Univ., N. Y. 27, N. Y. 

Brown, Prof. Theodore H. Ph.D. (Yale) Dept of Bus. Stat., Harvard Univ., Grad. 

School of Bus. Adm., Soldier’s Field, Boston 63, Mass. 25 Meadow Way, Cambridge 
Brumbaugh, Prof. Martin A. Ph.D (Pennsylvama) Dept, of Stat., Univ. of Buffalo, 
Crosby Hall, Buffalo 14, N. Y. 

Bruner, Nancy M A (Iowa) Stat., Western Auto Supply Co., 2107 Grand Ave., Kansas 
City 8, Mo , 7511 Mam Street, Kansas City 5 
Bruyere, Martha C. M D (Chicago) Stat, U, S Public Health Service, Bethesda 14, 
Md , Gaithersburg, Md. 

Bruyere, Paul T. MD (Chicago) Stat., U. S Public Health Service, Bethesda 14, 
Md , Gaithersburg, Md. 

Bryan, Joseph G. Ed M. (Harvard) Staff Member, Div. Ind. Coop., Mass. Inst. Tech. 
Cambridge, Mass., 97 Green St., Melrose 

Budne, Thomas A. M.A. (N. J. State Teachers Coll.) Instr.of Math., N. J. State Teach¬ 
ers Coll., Upper Montclair, N. J., 2038 78th St., Brooklyn 14, N. Y. 

Bunke, Alfred M.A. (Columbia) Sr. Stat., N. Y. State Dept, of Labor, Albany, N. Y., 
37 Parkwood St., Albany 3 

Burgess, Robert W. Ph.D. (Cornell) Chief Econ., Western Elec. Co., 196 Broadway, 
N. Y. 7, N. Y. 



622 


MEMBERS OP THE INSTITUTE 


Burlngton, Richard S. Ph D. (Ohio State) Chief Math., Dir. of Evaluation and Analysis 
Group, Bureau of Ord , Navy Dept. Wash. D. C., BSOO N. Carlin Sj). Rd , Arlington, 
Ya. 

Burkf Marjorie F. B.A. (Hunter) Srd Si, N.E., Wo«/i. D. C 

Burke, H. D. Chief of Inspection and Qual. Control, The Coleman Co,, Inc., Wichita 1, 
Eans. 

Buros, Oscar K. M.A (Columbia) Rutgers Univ , New Brunswick, N. J., 8J80 Monf- 
gomery St., Highland Park 

Burr, Asso. Prof, Irving W. Ph.D. (Michigan) Dept, of Math., Purdue Unvv., W. Lafay¬ 
ette, Ind., SBS Liiilelon Si. 

Bushey, Xaao. Prof. J. Hobart Ph.D. (Michigan) Dept, of Math., Hunter Coll , 695 
Park Avo , N Y. 21, N Y , BOl W llSlh St, N. V. SB 

♦Camp, Prof. Burton H. Ph.D. (Yale) Math Dept., Wesleyan Univ., Middletown, 
Conn , 110 Mi. Vernon Si. 

Campbell, Prof, Frances L. Ph.D. (Michigan) Dept, of Math., George Pepperdine Coll., 
1121 W. 79th St., Los Angeles 44, Calif. 

Campbell, George C. M.S. (Iowa) Supervisor in Actuarial Div , Metropolitan Life Ins. 
Co , N. y. 10, N. Y., Troy Rd., RFD §1, Boonion, N. J. 

Campbell, James T. Ph D. (Edinburgh) Univ. Math. Lecturer, Victoria Univ. Coll., 
W 1, New Zealand 

Cannon, Edward W. Ph.D. (Johns Hopkins) Comdr., U. S Navy, Res and Standards, 
Branch of Bureau of Ships, Cannon, Del. 

Canter, Stanley D. B.S. (CON Y.) Cpl. U S. Army, 1st AAFDU, Sqdn “A”—Bolling 
Field, Wash , D C., £676 Morris Ave., Bronx B8, N Y 

Caplan, Benjamin Ph.D (Chicago) Econ , OPA, Wash,, D. C. £8S1 28th St., N.W., 
Wash. 8 

Capo, Bernardo G. Ph.D. (Cornell) West Copake, N. Y, 

Carlson, John L. M.A. (Stanford) Lt. Comdr. USNR, Dept 2B-Navy 3237-FPO, San 
Francisco, Calif. 

Carlton, A. George B A. (Gustavus Adolphus) 262 W 102nd St., N Y. 25, N. Y. 

Carter, Gerald C. Ph.D, (Purdue) Supv. of Training and Activities, Univ. of III., 
Urbana, Ill. 

Carvalho, Prof. Pedro Egydlo de OUvelrn Ph.D. (Sao Paulo) Faculdado de Higiene, 
Univ. of Sao Paulo, Avemda Dr Arnaldo 85, Caixa Postal 99-B, Sao Paulo, Braiil 

f*Carver, Prof. H, C. Ph.D. (Michigan) Dept, of Math., Univ. of Mich., Ann Arbor, 
Mich., 3452 Richard, PtUsfield Village 

Casanova, Teobaldo Ph.D (New York) Res. Stat, Inst, of Legal-Soo. Res , Univ. of 
Puerto Rico, Rio Piedras, Puerto Rico. 

Cederberg, Prof. William E. Ph.D (Wisconsin) Augustana Coll., Rook Island, HI., 
£S4£ ££i Ave 

Chances, Ralph B B.S. (C.C.N.Y.) 46 W. 83rd St., N. Y., N. Y. 

Chang, Calvin C. M.A. (Michigan) Public Acct., 132 W. First St., Los Angelos, Calif., 
l£9i W, Santa Barbara Ave. 

Chang, Z. T. A,M (Columbia) 7 Lane 720, Avenue Fooh, Shanghai, China 

Chapman, Roy A. B.S. (Minnesota) Silviculturist, Div. of Ec., U, S. Forest Servioe, 
Wash., D. C. 

Chassan, Jack B.S (New York) Stat, Medical Stat., Div. S. G. 0. War Dept., 2B-540 
Pentagon Bldg., Arlington, Va., 3013 SOth St, S E., Wash. £0, D C. 

Chen, Way Ming Ph.D, (California) Instr. in Math., Brown Univ., Providence 12, R. I. 

Christopher, Edward E. B S (Mass. Inst Tech.) Stat., War Dept., Wash , D. C., 
S704 N. £6lh St., Arlington, Fa 

Chung, Kal-Lal M.A (Princeton) Graduate College, Princeton Univ., Princeton, 
N. J. 



MEMBEIia OE THE INSTITUTE 


623 


Church, Asso. Prof. Randolph Ph.D. (Yale) Postgrad. School, U. S. Naval Academy, 
Annapolis, Md., 316 N. Olen Ave. 

Churchill, Edmund A.M. (Columbia) Rutgers Univ., New Brunswick, N. J, 

Churchman, Asst. Prof. C. West Ph.D. (Pennsylvania) Philosophy Dept., Univ. of 
Pa., Philadelphia 4, Pa., 318 S. hemtnger St. 

Clark, Prof. Andrew G. M A. (Colorado) Dept, of Math., Colorado A A M Coll., Ft. 
Collins, Colo,, 631 Whedhee St 

Clarke, P. C. Asst. Gen. Mgr , Huntor Pressed Steel Co., Lansdalo, Pa., Dine Lexington, 
Pa. 

Clarkson, Prof. John M. Ph.D. (Cornell) Dept, of Stat., N. C. State College, Raleigh, 
N C , Stale College Station 

Clifford, Asst. Prof. Paul C. M A. (Columbia) State Teachers Coll., Montclair, N. J., 
6 J 1 .I Upper Mountain Ave. 

CUnedinst, William O. Mech. Eng. (Carnedie Inst, of Tech.) Dir. of Res., National 
Tube Co , 1702 Frick Bldg., Pittsburgh, Pa., 40 O Thomycroft Ave., Pitleburgh 16 

Cloudman, Charles G. M.So (Rhode Island State) Ebaaco International Corp., 2 Rector 
St , N. Y., N. Y , Towers Hotel, ZB Clark St., Brooklyn 

Cobh, William J. Stat., Census Bureau, 4036 8th St., N.E., Wash., D. C. 

*Cochran, William G. M A (Cambridge) Asso Dir., Inst, of Stat, N. C. State Coll., 
Raleigh, N, C , 1003 Brooks Ave. 

Cody, Blanca R. M.A. (Columbia) Stat., Market Res., U. S Rubber Co., 1230 Ave. 
of the Americas, N Y., N. Y., B6^ Auduban Ave. JV. Y. 33 

Cody, Donald D, A,B. (Harvard) F A.S, F A I.A, Asst Actuary, Equitable Life Aasur- 
anoo Soc., 393 Seventh Ave , N Y 1, N. Y., 308 W. I 04 St, N. Y SB 

Coggins, Paul P. AM (Harvard) Supv. Acet., Amor. Tel. and Tel Co., 196 Broadway, 
N. Y., N. Y. 

Cohen, Alonzo C. Ph D. (Michigan) Lt Col, Army Univ. Study Center %2, APO 772, 
c/o Postmaster, N. Y , N Y 

Cohen, Josef Ph.D (Cornell) Instr. in Psychology, Cornell Univ., Ithaca, N. Y , 
1 E. Ave 

Cohen, Karl Ph D. (Columbia) Standard Oil Dev. Co , Box 243, Elizabeth, N. J. 

Coleman, Asst. Prof. Edward P. MS (Iowa) Univ. of Omaha, Omaha, Nebr. (on leave) 
Instr in Math., U 8. Military Academy, West Point, N. Y. 

Coles, Asst. Prof. James S. Ph D (Columbia) Dept, of Chemistry, Brown Univ., 
Providence, R. I 

Coon, Helen J. M A (Southern Methodist) Ballistics Res. Lab., Aberdeen Proving 
Ground, Md. 

Cooper, William W. A.B (Chicago) Instr. in Econ., Univ of Chicago, Chicago 37, Ill., 
6BS9 S. Ellis Ave 

Cope, Asso. Prof. T. Freeman Ph D. (Chicago) Math. Dept., Queens Coll., Flushing, 
N Y , Montrose, N. Y. 

*Copeland, Prof. Arthur H. Ph.D. (Harvard) Dept, of Math., Univ. of Mich., Ann 
Arbor, Mich , 616 Oswego St 

Copp, Warren F. B.S (Ohio State) Supv., Quality Control Dept., Wheeling Steel Corp., 
Yorkville Works, Yorkville, Ohio 

Cornell, Francis G. Ph D. (Columbia) Chief, Res. and Stat Service, U. B. Office of 
Educ , Wash 25, D. C , 113 Soulhbrook Lane, Betheada I 4 , Md. 

Cornfield, Jerome B.S'. (New York) Stat, Dept, of Labor, Wash. D, C., R.F.D. §S 
Herndon, Va 

Cotterman, Asst. Prof. Charles W. Ph D (Ohio State) Asst. Prof, of Zoology & Asst. 
Geneticist, Vertebrate Biology Lab, Univ. of Mich., Ann Arbor, Mich., S71B Brock¬ 
man Blvd. 

Court, Dr. Louis M. Ph D, (Columbia) Multitrade Ltd., 141 Broadway, N. Y, 6, N. Y. 



624 


MEMBERS or THE INSTITUTE 


Cowan, Donald R. G. Ph.D. (MinneBOta) Donald R. G. Cowan and Aaaooiatea, 1216 
Citizens Bldg., Cleveland 14, Ohio 

Cowden, Prof. Dudley J. Ph.D. (Columbia) Dept, of Econ. Stat,, Univ. of N. C,, 
Chapel Hill, N. C., Country Club Road 

Coi, Gerald J. Ph.D. (Dlinoia) Rea. Chomist, Corn Prod. Refining Go., Argo, Ill,, 
BOO S. 7lh Ave., La Orange 

*Cot, Gertrude M. M.S. (Iowa State) Dir., Inst, of Stat., N. C. State Coll., Raleigh, 
N. C. 

♦Craig, Prof. Allen T. Ph.D. (Iowa) Dept, of Math., Univ. of Iowa, 208 Physios Bldg., 
Iowa City, Iowa 

♦Craig, Prof. Cecil C. Ph.D. (Michigan) Prof, and Dir. of Stat, Ree Ijab , Univ. of Mich., 
Ann Arbor, Mich., 1410 Irogouie 

♦Cramdr, Prof. Harald Ph.D. (Stockholm) Inst, of Math. Stat., Univ. of Stockholm, 
Norrtullsgatan 16, Stockholm, Sweden, Skdroikavagen 7, Djursholm (On leave from 
Oct. 1, Princeton Umv., Princeton, N. J.) 

Crawford, Ellzaheth S. B.S. (Mundelein) Stat., War Assets Adm , Commonwealth 
Bldg , Denver, Colo, S878 Ash St. 

Crawford, Janies R. Dept. Mgr., Master Scheduling, Lockheed Aircraft Corp , Burbank, 
Calif., lieSB Kitlridge Si , N Hollywood 

Cruden, Dorothy A.B. California) SI Chabot Terrace, San Francisco 18, Calif. 

Crump, Asst. Prof. S. Lee B.S. (Cornell) Stat. Lab., Iowa State Coll., Amos, Iowa 

♦♦Cudmore, Sedley A. M.A. (Oxford) Dominion Stat., Dominion Bur. of Stab , Ottawa, 
Ont., Can. 

Cureton, Edward E. Ph.D. (Ckilurabia) Secy.-Treaa., Richardson, Bellows, Henry & 
Co.j Inc., 66 Beaver St., N. Y. 4, N. Y., I6i Larchmonl Ace., Larohmonl 

Curry, Prof. Haskell B. Ph.D. (Gdttingen) Dept, of Math , Pa. State Coll., State Col¬ 
lege, Pa., BS8 E. Prospect Ave. 

♦Curtiss, JohnH. Ph.D. (^rvard) Asst, to Dir., Natl. Bur. of Standards, Wash., D. C., 
480S Bradley Blvd., Chevy Chase, Md. 

Cynamon, Manuel M.S. (C.C.N.Y.) Res. Asso., Amer. Inst, for Res., Univ. of Pitts¬ 
burgh, Pittsburgh 18, Pa. 

♦Daly, Joseph F. Ph D (Princeton) Stat., Bur. of the Census, Wash. 25, D. C., 4B1S 
lath SI., N.E., Wdsh. 18 

Daniel, Cuthbert M.S. (Mass. Inst. Tech.) Stat. Engr., Carbide & Carbon Chem. 
Corp., Oak Ridge, Tenn., 4^0 E. Drive 

Dantzlg, George B. M.A. (Michigan) c/o F. Shmuner, 3705 Edmondson Ave., Baltimore, 
Md. 

Darkow, Asso. Prof. Marguerite D. PhD. (Chicago) Dept, of Math., Hunter Coll., 
686 Park Ave., N. Y. 21, N. Y , 16 E. S«»d St, N. Y. 88 

♦David, Florence N. Ph.D. (London) Lecturer, Dept, of Stat., University Coll., London, 
W.C. 1, Eng., 88 Torringfor Square, London W.C. 1. 

Davidson, James H. M.B. (Va. Polyteoh. Inst.) Res Asst., Grad. Student, Princeton 
Umv., Princeton, N. J., Frick Chemical Lab. 

Day, Besse B. AM. (Miolugan) Applied Physios Lab., The Johns Hopkins Univ., 
8621 Georgia Ave., Silver Spring, Md , Sise Dumbarton Ave., N.W., Wash. 7, D. C. 

Deemer, Walter L., Jr. Ed.D, (Harvard) Lt, Col., AC, School of Aviation Medicine, 
Randolph Field, Texas 

De Oarls, Prof. Charles F. Ph.D, (Johns Hopkins) Dept, of Anatomy, School of Medi¬ 
cine, Umv. of Okla , 801 N.E 13th St., Okla. City 5, Okla., 1108 N.E. 16lh St. 

Delhi, D, George M.A (Drake) Stat., National Cancer Inst,, U. S. Public Health 
Service, Bethesda, Md., S89B Porter St., N.W., Wash. 16, D. C. 


♦♦ Deceased. 



MEMBERS OF THE INSTITUTE 


525 


de Loor, Prof. Barend Ph.D. (Amaterdam) Umv of Pretoria, Pretoria, Union of South 
Africa 

Belsa, Alexis A I Lg. (Liege) Mgr Basic Bessemer Steelworks, Societe Anonyme John 
Cockerill, Seraing, Belgium 

DeLury, Prof. Daniel B. Ph D (Toronto) Dept, of Stat., Va. Polyteoh Inst., Blacks¬ 
burg, Va. 

t*Denilng, W. Edwards Ph.D. (Yale) Adviser in Sampling, Bur of the Budget, Wash 
26, D. C., SuUeTWOTih PL, Wash. 16 

Dempsey, Rev. Bernard, S.J. Ph.D. (Harvard) Asso Prof and Regent, School of Com¬ 
merce and Finance, St. Loms Univ., 3674 Lindell Blvd., St. Louis, Mo 
Densen, Asst. Prof. Paul M. D So. (Johns Hopkins) Vanderbilt Univ Med School, 
Nashville 4, Tenn, 

Derrick, Asst. Prof. Luclle M.A. (Peabody) School of Business, Univ of Chicago, Chi¬ 
cago 37, Ill , 6S4S KivibaTk 

Dletzold, Robert L. S. B (Maas Inst Tech ) Member Tech Staff, Bell Tel Lab , 
Murray Hill, N J. 

*Dleulefait, Prof. Carlos E. Dir., Instituto Estadiatica, Umv Nacional del Litoral, Pueyr- 
redon 1235, Rosario, Argentina 

Dlmsdale, Bernard Ph D. (Minnesota) c/o Letbomlz, Ills Grand Concourse, N, Y. 
es, N Y 

Divatla, M. V. Dept, of Industries and Civil Supplies, New Delhi, India 
Dlvatla, Vasishtha V., B Sci (Bombay) Student in Math. Stat., Columbia Univ., 
at tSIf John Jay Hall, Columbia Univ,, N Y Cily 
Dix, Margaret J. M S (Rice Inst ) Secy., Stat. Lab, Umv of Calif., Berkeley 4, 
Cahf 

Dixon, Asso. Prof. Wilfrid J. Ph D (Princeton) Univ. of Oregon, Eugene, Oregon 
*Dodge, Harold F. A M. (Columbia) Qual Results Bngr, Bell Telephone Labs., 463 
West St, N Y. 14, N Y , 96 Briarcliff Rd., Mountain Lakes, N. J. 

Dominguez, Emilia A. Ec S (Buenos Aires) Actuary, Supt Personas Jundicas de 
Buenos Aires, Marlines Castro 765, Buenos Aires, Argentina 
Dominguez, Jose F. Eo.S. (Buenos Aires) Tech. Council Inst. Nacional de Prevision 
Social, Martinez Castro 765, Buenos Aires, Argentina 
♦Doob. Prof. Joseph L. Ph.D (Harvard) Umv. of Ill, Urbana, Rl , 108 W High St 
Dorfman, Robert M A (Columbia) Operations Analyst, Hq Army Air Forces, The 
Pentagon, Wash. 25, D. C , 14SS Girard Si , N,W , Wash. 9 
Dorn, Harold F. Ph D. (Wisconsin) Lt. Col , U. 8 Public Health Service, Wash. 26, 
D C., 15 Burning Tree Court, Bethesda 14, Md 
■fDorweller, Paul B.S (Iowa) Actuary, Aetna Casualty & Surety Co., Hartford 15, Conn. 
Dresch, Francis W. Ph D. (California) Lt Comdr , USNR, Naval Proving Gd , Dahl- 
gren, Va. 

Dressel, Paul L. Ph D (Michigan) Chm , Board of Exanuners, Dir. of Counseling, 
Mich State Coll., E Lansing, Mich., 165 Milford St 
Duncan, Asso. Prof. Acheson J. Ph.D. (Princeton) Dept of Political Economy, The 
Johns Hopkins Umv., Baltimore, Md., c/o Mrs Joseph Foster, 4406 Roland Ave. 
Duncan, David B. B Sc (Sydney) Grad Student, Stat Lab , Iowa Slate Coll , Ames, 
Iowa 

Dunlap, Jack W. Ph D (Columbia) Dir., Div of Biomechamcs, Psychological Corp., 
622 Fifth Ave , N. Y., N. Y 

Durand, David PhD (Columbia) Natl. Bur of Bcon Res , W.264th St and Independ¬ 
ence Ave., N. Y. 63, N. Y, 

Dutka, Jacques Ph.D (Columbia) 740 Gerard Ave., Bronx 51, N Y, 

♦Dwyer, Prof. Paul S. Ph D (Michigan) Umv. of Mich , Ann Arbor, Mich., 640 Oxford 
Rd. 



526 


MEMBEHS OF THE INSTITUTE 


Dyson, John D. B S. (S. Dak. State) Major, U. S. Army, Patient Fitzsimmons Gen. 
Hosp , Denver, Colo., 108 S. Jefferson, Pierrit, S. Dak. (Last address) 

Eaves, James C. M.A. (Kentucky) Instr., Math. Dept., Umv. of N C., Chapel Hill, 

N C. (Last address) 

Echegaray, Miguel de Ag. Attaehe to the Spanish Embassy, 2700 16th St., N.W., Wash., 

D C 

Bde, Richard B S. (Wisconsin) Cliem Dovel. Metallurgist, Gary Works, Car. Steel 
Ill , Sp FillmoTe Si , Gary, Ini. 

Edgett, Asso, Prof. George L. Ph.D. (tllinoia) Dept, of Math., Queen’s XJmv., Kingston, 
Out., Can , 41 TraytTMon Ave. 

t*Elsenhart, Asso. Prof. Churchill Ph D. (London) Biometrieian, Math. Dept , Umv. 
of Wis Biometry and Physics Sec , Wis. Agn. Exp. Sta., Madison 6, Wis., 188g 
Monroe St., Madison B 

Elconin, Victor M S (Calif. Inst. Tech.) Asso. Physicist, Calif. Inst. Tech ; Tech 
Dir , Airtronica Mfg Co.; Tech. Dir., Electronic Industries Tech. Inst., Los Angeles, 
Calif., 740 Cordova Ass., Glendale 6 

Eldredge, George G. PhD. (Minnesota) Chemist, Shell Development Co., Emeryville, 
Calif., 8743 Greenwood Dr., San Pablo 

Elkin, William F. M S.P.II. (Michigan) Stat., Dept, of Health, P 0. Box 486, Oak 
Ridge, Tcnii., 40 S Pennsylvania Ave. 

Elkins, Thomas A. AM (Princeton) Geophysicist, Gulf Res. & Development Co., 
P.O. Drawer 2038, Pittsburgh 30, Pa., S85S Parkview Ave., Pillsburgh IS 

Ellis, Wade Ph D. (Michigan) Radiation Lab., Maas. Inst. Tech , Cambridge, Mass., 
1BB6 Cambridge SI. 

Elmore, Francis B. B.S, (Clornson) Qual Control Engr., Union Bag and Paper Co , 
Savannah, Ga., 1311 E. BBlh Si. 

Elston, James S. A.B. ('Cornell) Asst. Actuary, Travelers Ins. Co., Hartford, Conn. 

Eltlng, John P. M.S. (Mass. Inst. Tech.) Dir. of lies., Kendall Mills, Paw Greek, N. C., 
ISO 4 DiUmore Dr., Charlotte 

Blveback, Lillian R. B,A. (Minnesota) Instr., Biostatistics Dept, School of Public 
Health, Columbia Umv., 600 W. 168th St., N. Y 32, N, Y 

Elveback, Asst. Prof. Mary L. M A, (Minnesota) Rockford Coll., Rockford, Ill. 

Epstein, Benjamin Ph.D. (Illinois) Staff Asst., Wostinghouae Eloc. Corp., E. Pitts¬ 
burgh, Pa., 833 Eivermonl Dr , Pillsburgh 7 

Eudey, Mark W. A.B. (California) Lecturer in Math, and Res. Asst., Stat. Lab., Umv. 
of Calif,, Berkeley, Calif , 886 Santa Barbara Rd,, Berkeley 7 

Evans, Prof. Herbert P. PhD. (Wisconsin) North Hall, Umv of Wis , Madison 6, 
Wis, 

Evans, Wilmoth D. B,S. (Clarkson Coll. Tech.) Chief, Productivity and Tech Devel. 
Div., U. S. Bur of Labor Stat, Wash. 25, D. C , 409 N George Mason Dr., Arlington, 
Va 

Evensen, Edward J. Metropolitan Life Ins. Co., San Francisco, Calif , 180 Holloway 
Ave,, San Francisco 18 

Evers, DlUon Ph.D. (Iowa) Capt., Otd. Dept, St, Louis Ordnance Plant, 4300 Good- 
fellow Blvd., St. Louis 20, Mo. (Last address) 

Ewart, Robert B A. (Now York) Res. Physicist, 6881 Braun St , Centerline, Mich. 

Fadner, Raymond H. B.A. (Minnesota) Commodity Records, Div Dep. Director, 
UNRRA, 1344 Connecticut Ave , N.W., Wash., D.C,, 8S18 nth St., N.W , Wash. 9 

Fanshaw, Hugh L. M So. (Mamtoba) Standards Supv., Gen. Chem Div., Canadian 
Indus. Ltd., Hamilton, Ont., Can,, 180 St Clair Ave. 

Fattu, Asso. Plot. Nicholas A. Ph D. (Minnesota) Mich. State ColL, E. Lansing, Mich. 

Federer, Walter M.S. (Kansas State) Res. Ag , Stat. Lab , Iowa State Coll., Ames, 
Iowa 



MEMBERS OP THE EVSTITUTE 


527 


Feldman, Hyman M. Ph.D. (Washington) Beaumont High School, St. Louis, Mo. 
Fellppe, Jose G. Pres,, Nat. Census Comm., Trav. Sia Terezinha 35, Rio de Janeiro, 
Brazil 

t*Feller, Prof. Will Ph.D (Gattingen) Cornell Univ., Ithaca, N Y., BU Highland Rd 
Feraud, Prof. Luclen Umv. of Geneva, 24, rue H. Mussard, Geneva, Switzerland 
Feriet, Kampe De Dr Sci (Paris) Profesaeur a la Faculte des Sci de PUniversite de 
Lille, 16 rue des Jardins, Lille, France 

Femandez-Bafios, Olegarlo D.C. (Madrid) Catedratico, Univ. of Madrid, Calle Lopez 
de Hoyos 7, Madrid, Spain 

Ferrell, Enoch B. M.A. (Oklahoma) Ees, Engr., Bell Tel Labs , 463 West St, N. Y. 
14, N. Y., 75 Fuller Ave,, Chatham, N J. 

Ferris, Charles D. A.B (Princeton) Qual Control Engr, General Elec Co, 1286 
Boston Ave , Bridgeport, Conn , 310 Woodstock Ave,, Stratford 
Fertlg, Prof. John W. Ph D. (Minnesota) Dept of Biostatistics, School of Pub Health 
of the Faculty of Medicine, Columbia Untv , 600 W 168th St, N. Y , N Y ,41 South- 
lavm Ave , Dobhs Ferry 

Field, Asso. Prof. Eobert W. PhD. (Illinois) Dept of Ind Engr, Purdue Umv., 
Lafayette, Ind , 9S0 Rose St,, W Lafayette, Ind 
File, Quentin W. Ph D (Purdue) Labor Utilization Analyst, Wright Aeronautical 
Corp., Cincinnati, Ohio, 35 Gahl Terrace, Reading IB (Last address) 

Fine, Clarence B. B.S S. (C.C.N Y ) Economist, OPA, Wash , D C , 1 S88 Tucherman 
St., N.W., Wash 11 

Fischer, Asso. Prof. Carl H. Ph D (Iowa) Dept of Math , Umv. of Mich , Ann Arbor, 
Mich , 1106 Morton Ave. 

t*Flsher, Prof. Irving PhD (Yale) Prof Emeritus, Yale Umv., Box 1825, New Haven 
8, Conn., 113 Park Ave , Hamden 

Fix, Evelyn MA (Minnesota) Lecturer in Math, and Res Asst., Stat Lab., Univ. of 
Calif, Berkeley, Calif 

Flaherty, Asst. Prof. William C. A.B (Georgetown) Georgetown Univ , Wash , D C. 
Flood, Merrill M. Ph D. (Princeton) Owner, Merrill Flood and Assoc , 20 Nassau St, 
Princeton, N J , 3806 Kanawha St, NW , Wash IS, D C. 

Foster, Prof. Ronald M. SB (Harvard) Dept of Math , Polytech Inst, of Brooklyn, 
85 Livingston St., Brooklyn 2, N Y , 1S6 Slanmore PI , Westfield, N. J 
Fox, Asso. Prof. Phillip G. A. M (Wisconsin) 403 Sterling Hall, Univ of Wia., Madison, 
Wis 

Frank, David H. B.S (CONY.) Admin Asst, Long Island City High School, 28-01 
41st Ave , Long Island City 1, N Y , ill W. 114th St, N Y SB 
Frankel, Lester R. M A (Columbia) Stat, Dun and Bradstreet, Inc , 290 Broadway, 
N Y. 8. N Y 

Franzen, Raymond Ph D (Columbia) Stat Consultant, 10 Rockefeller Plaza, N Y 
20 N Y 

Fraser, Andrew M.A (George Washington) Independent Consultant, 6441 Wiscosset 
Rd , Glen Echo Hts., Md 

Freeman, Albert M. Dir Math Lab , Boston Fiduciary and Res Assoc , 60 Congress St, 
Boston, Mass , 8 Parkside Rd., Providence S, B. I 
Freeman, Asso. Prof. Harold A. S B. (Mass Inst Tech.) Dept of Stat, Mass Inst. 

of Tech,, Cambridge, Mass., BOB Pleasant St, Belmont 78 
Freeman, Richard B.Sc. (McMaster) Res. Chemist, 1 Maple Ave., Hamilton, Ont, 
Can. 

Freund, Asst. Prof. John E. M A (U.C L.A ) Dept of Math , Alfred Univ , Alfred, 
N Y., Box 183 

Friedman, Asst. Prof. Bernard PhD. (Mass Inst. Tech ) New York Umv., 53 Wash¬ 
ington Sq. S , N Y., N Y., 37-41 81st St., Jackson Heights 



528 


MEMBERS QF THil INSTITUTE 


*Frle(iman, Asso, Prof. Milton Ph.D. (Columbia) Dept, of Boon., Univ. of Chicago, 
Chicago 37, Ill., 67gB S. Kenwood Ave. 

Froellch, Kathryn B.A. (Rvansville) Stat, U. S. Dept, of Agrio., Bur. of Human Nu¬ 
trition and Home Econ., Beltsville Res. Center, Md., B Lee Ave., Takoma ParklS 

*Pry, Thornton C. Ph.D. (Wiaconain) Dir. of Switching Rob , Bell Tel. I.a.ba., Inc., 403 
West St., N Y. 14, N. Y. 

Fryer, Prof. Holly C. PhD (Iowa State) Stat., Agrio. Exp. Sta., Kans State Coll., 
Manhattan, Kana„ /430 Legore Lane 

Gage, Robert P. M.S. (Iowa State) Asso. Med. Stat., Mayo Chnic, Rochester, Minn. 

Gause, G. Rupert B S. (The Citadel) Tech. Staff, Bell Tel. Ubs., 463 West St., N. Y. 
14, N Y., IBS mh Ave., Sea CM 

Gauthier, Prof. Abel AM. (Columbia) Universite do Montreal, 2900 Mount Royal 
Blvd , Montreal, Quo , Can 

♦Gelrlnger, Prof. Hilda P. Ph.D. (Vienna) Head, Math Dept., Wheaton Coll., Norton, 
Mass. 

Germond, Hallett H. PhD. (Wisconsin) Umv. of Fla,, Gainesville, Fla. 

Gersten, Lydia B, B.A (Hunter) Res. Stat,, 100 Lincoln PI., Brooklyn 13, N. Y. 

Ghormley, Glen E. B S. (Te.xas) Coordinator to Works Mgr., Lockheed Aircraft Corp., 
Burbank, Calif,, 1S9 N. Chester Ave., Pasadena 4 

GUI, Asst. Prof. John P. M A. (Alabama) Stat., Bur of Business Res., Umv. of Ala., 
University, Ala 

Glntzler, Leone B. M.A (California) SSSS gist Ave., San Francisco 16, Calif. 

*Glrshlck, Meyer A. Ph.D. (Columbia) Prin Stat, U. S. Dept, of Agrio., Wash. 25, 
D, C., mo ISth SI , N.P., Wash. 16 

Godfrey, Asst. Prof. Edwin L, A,M. (Indiana) Dept of Math., Defiance Coll., Defiance, 
Ohio, 16S Session SI 

Goffmon, Asst. Prof. Casper Ph.D, (Ohio State) Math. Dept., Univ. of Ky., Lexington, 
Ky., 1144 Shadycreal Dr., Pittsburgh Iff, Pa. 

Goldrosen, David B.S (Worcester Polytcch. Inst,) Lt., USNR, Qual. Control Officer, 
Insp. of Naval Mat’l, S 04 Ward St., Neiulon Centre, Mass. 

Goldstlne, Herman H. PhD. (Chicago) Asst. Project Dir., Elootronic Computer 
Project, Institute for Advanced Study, Princeton, N J., 4 OS 8 Walnut St., Phila¬ 
delphia 4, Da 

Golub, Abraham B.A. (Brooklyn) Math., Ballistics Res. Lab., Aberdeen Proving 
Gd , Md., Men's Dorm 

Gomberg, William Ph.D. (Columbia) Dir of Mgt , Engr. Dept,, International Ladies 
Garment Workers Union, 1710 Broadway, N Y , N. Y , 444 Beach 14Snd St, Neponsit, 
L. I. 

Goode, Harry H. AM (Columbia) Math., Special Devices Div., Office of Res. and 
Inventions, USN, Sands Point, L, I., N Y., SSBl Cambridge Ave., N. Y. 63 

Goodman, Albert A. Supv , Stat Qual. Control, Westinghouse Elec. Corp., Esslngton, 
Pa., 1S9 Green Valley Rd,, Upper Darby 

Gordon, Chester H. S.M. (Mass, Inst. Tech.) Staff Member, Div. of Ind. Coop., Mass 
Inst. Tech., Cambridge, Mass , S8 Cnatofaro St., Wakefield 

Gordon, Donald A. A. M (Columbia) Asst., Columbia Univ., N. Y., N. Y , c/o Prof. 
C. J. Warden, Psychology Dept. 

Gordon, John J. Sr. Engr., Qual Control, Western Elec. Co., Ino., Engr. Div., 7220, 
100 Central Ave., Kearny, N. J., 48S Ridge Rd., Apt. BQ, N Arlington 

Gordon, Robert D, A.M (Stanford) Teaching Asst., Math. Dept., Indiana Univ., 
Bloomington, Ind. 

Gottfried, Bert A. M.A (Columbia) Res. Analyst, Marketing and Res, Service, Dun and 
Bradstreet, 290 Broadway, N, Y., N. Y., 60SB Boulevard East, W. New York, N. J, 

Gottfried, Dorothy K. A.B, (Hunter) Stat., Colgate-Palmolive-Peet Co , Jersey City, 
N. J., 60SB Boulevard East, W. New York, N, J 



MEMBEHS OF THE INSTITUTE 


529 


Gough, Elsie L. A.M (Michigan) Asst, to Correspondent on Group Annuities, Metro¬ 
politan Life Ins, Co , 600 Stockton St, San Francisco, Calif, QIS Boulevard Way, 
Oakland 10 

Grant, Asso. Prof. David A. Ph.D (Stanford) Dept, of Psychology, Univ. of Wis , 
Madison 6, Wis., SSIO Kendall Are,, Madison S 
Grant, Prof. Eugene L. A.M. (Columbia) Dept of Civil Engr., Stanford Umv., Calif. 
Graves, Clyde H. Ph D. (Chicago) Operations Branch Chief, Office of Price Bd Mgt., 
OPA, Wash., D C,, 700 N. Wayne St., Arlington, Va 
Green, Asso. Prof. Earl L. Ph.D (Brown) Dept, of Zwlogy, Ohio State Univ , Columbus 
10, Ohio 

Greene, Kenneth E. B S. (Yale) 47S4 Post Road, Pelham Manor SS, K. Y 
Greenhouse, Samuel B S. (C.C.N.Y ) T/4 U. S. Army, 5818 13th St, N.W., Wash. 11, 
D. C. 

Greenleaf, Prof. Herrick E. H. Ph.D (Indiana) DePauw Univ., Greencastle, Ind., 
I0S4 B. College Are. 

Greenwood, Joseph A. PhD. (Missouri) Stat, Bur Aeronautics, Navy Dept., 1W66, 
Wash 26, D. C. 

Grelder, C. Edwin, Jr. B.A (Michigan) Actuarial Accountant, General Elec Co , 1 
Eiver E.d., Schenectady 6, N. Y., 7S0 Union Si 
Gretton, Owen B.A. (Brown) Acting Chief, Ind Div. Sen. Econ , 101S7 Old Bladensburg 
Rd., Silver Spring, Md. 

Grevllle, Thomas N. E. Ph D. (Michigan) Actuarial Math , Bur, of the Census, Wash. 
26, D. C., 1714 S4lh Si., N W., Wash 7 

Griffin, John I. Ph D (Columbia) Inatr , Econ. Dept., Coll of the City of N Y., 17 
Lexington Ave , N Y , N Y ,115 Henry St, Brooklyn $ 

Grlffltts, Prof. C. H. Ph.D. (Michigan) Univ of Mich., Ann Arbor, Mich., 7507 CAorllon 
Are. 

Groth, Alton O. M.S. (Iowa) Asst. Actuary, Equitable Life Ins. Co. of Iowa, Des Moines 
6, Iowa, S909 Waveland Dr., Des Moines 11 

fGrove, Asst. Prof. Charles C. PhD (Johns Hopkins) Coll, of the City of N Y., N. Y., 
14 s Milburn Are., Baldwin, N Y 

Groves, William B. B S. (Antioch) Econ , OPA, 620 Decatur St, N W., Wash., D. C. 
Grubbs, Frank E. MA. (Michigan) Chief, Surveillance Branch, Ballistic Bos. Lab,, 
Aberdeen Proving Gd , Md , 355 Wilson St , Havre de Qrace 
Guard, Harris T. MS. (Colorado) Instr , Dept of Math , Colorado A 4: M, Fort Collins, 
Colo., 221 S. Grant Are. 

Guilford, Prof. Joy P. Ph.D (Cornell) Dept, of Psychology, Umv. of S. Calif., Los 
Angeles 7, Calif , 509 N Bexford Dr , Beverly Hills 
Gulllksen, Prof. Harold Ph.D (Chicago) Prof., Dept, of Psychology, and Bes. Secy., 
Coll. Entrance Exam. Board, Princeton Umv., Princeton, N. J , /F Aiken Are. 
*Gumbel, Emil J. Ph.D (Munich) Spec Lecturer in Stat., Newark Coll, of Engineering, 
Newark, N. J , SS20 Waldo Ave , AT Y. 63, N. Y. 

Gunlogson, Lee S. B.B.A. (Minnesota) Stat Dept, Lumbermens Mutual Casualty 
Co , Mutual Ins Bldg., Chicago 40, Ill, 710 Lake Shore Dr., Chicago 11 
Gurland, John M.A (Toronto) Teaching Asst., Umv. of Calif , Berkeley, Calif., Inter¬ 
national House, Berkeley 4 

■fGumey, Margaret PhD (Brown) Stat., Bur of the Census, Wash. 26, D. C., 6102 
Lombard Si., Cheverly, Hyailsville, Md. 

t*Guttman, Asso. Prof. Louie PhD (Minnesota) Dept of Sociology, Cornell Univ., 
Ithaca, N. Y., SIS Dryden Rd. 

Gutzman, Asst. Prof. Wayne W. Ph.D (Iowa) Postgrad. School, Naval Academy, 
Annapolis, Md., Route 4tl 

♦Haavelmo, Trygve M. Cand Oecon. (Oslo) Dept, of Econ., 401 Social Science Bldg., 
Univ. of Chicago, Chicago 37, Ill. 



530 


MEMBERS OF THK INSTITUTE 


Hadley, Clausln D. Ph.D. (WiBCOoain) Stat, Eli Lilly & Go , 740 S Alabama St., In¬ 
dianapolis 0, Ind. 

Haeood, Margaret J. PhD. (North Carolina) Prin. Social Scientist, Bur. of Agric 
Econ., U. S Dept, of Agric., Wash. 25, D. 0., lliS Virginia Ave., S W., Wash 4 

Haines, Harold M.S. (New York) Prod Control Mgr., Burndy Engineering Co , 107 
Bruckner Blvd., N Y., N Y , 6B-6S Booth Si,, Forest Hills 

Halbert, Keet W. A.M. (Harvard) Stat., Amor. Tel. tk Tcl Co , 195 Broadway, N Y., 
7, N Y., 10 Rich Ave., Ml. Peiaoa 

Hall, Asso. Prof. Marguerite F. Ph.D. (Michigan) School of Public Health, Umv. of 
Mich., Ann Arbor, Mich., ZB Ridgeway 

Halmos, Asst. Prof. Paul R. Ph.D (IllinoiB) Dept of Math , Umv of Chicago, Chicago, 
Ill 

Hatton, Frederick J., Jr. Asst, to Pres., John Deere <t Co , 230 S. Clark St, Chicago, Ill., 
1314 Wesiview Rd., Highland Park 

Hamilton, Prof. Thomas R. Ph.D, (Columbia) A. & M. Coll, of Texas, Box 204 F.E., 
Coll. Station, Texas, Edge Apis 13, Bryan 

Hammer, Hans-Karl Ph.D, (Munich) Pillsbury Mills Inc., 21 West St., N. Y. 6, N. Y., 
1361 Plimpton Ave., N. Y B2 

Hammer, Asst. Prof. Preston C. Ph.D. (Ohio State) Dept of Math , Oregon State 
Coll., Corvallis, Oregon 

Hammond, Edward C. Sc.D. (Johns Hopkins) Major, AC, IJHAAF, Chief, Statistics of 
Flying Personnel Branch, OfRco of Air Surgeon, 4700 Connecticut Ave., Wash., D. G. 

Hand, Howard J. B S. (Carnegie Inat. Tech.) Ilea. Engr., National Tube Co., Frick 
Bldg., Pittsburgh, Pa., JfitB Wallingford St., Piilaburgh IS 

*HaaEen, Morrla H. M.A, (American) Stat. Asst, to the Dir,, Bur. of Census, Wash., 
D. C,, SIS Goddard Rd , Belhosda, Md. 

Hanson, Robert H. M S. (Iowa) Stat., Bur of the Gonsus, Wash , D. C., 3143 Westover 
Dr,, Wash. SO 

Hardy, Philip H. (Rice) Quality Engr., General Elec. Co , on leave, Cpl,, U. S, Army, 
4000 BU Sq. S,, Wright Field, Dayton, Ohio 

Harold, Miriam S. B.A (Hunter) Tech. Asst, Bell Tel. Labs., Murray Hill, N. J., 19 
Hillside Ave , Chatham 

Harris, Theodore E. M.A. (Princeton) 107 N. Field Si., Dallas, Texas 

Harrison, Joseph O., Jr. B.S. (George Washington) Math., Computation Project, 
Cruft Lab., Harvard Univ., Cambridge, Mass., 18 Forest St., Apt 4 

Harshbarger, Prof. Boyd Ph.D. (George Washington) Dept, of Stat., Va. Agric. Exp 
Sta , Va. Polytech. Inst., Blacksburg, Va. 

Hart, Alex L. Ph.D. (Minnesota) Dir. of Res. and PJarining Dept , Eastern Air Lines 
Ino , 10 Rockefeller Plaza, N, Y., N. Y., I 4 I-S 4 19th Ave , Flushing, L I, 

Hart, Bertha I. AM (Cornell) MAth., l^llistic Res. Labs , Aberdeen Proving Gd. 
Md. C3-^ Grant Ave., Aberdeen 

Hart, Prof. William L. Ph.D. (Chicago) Univ. of Minn., Minneapolis, Minn. 

Haskins, Asso. Prof. Elmer E. Ph.D. (Boston) Northeastern Univ., Boston, Mass. 
BS Damien Rd., Wellesley Hills 8S 

Hastay, MUlard A.B, (Reed Coll.) Res. Asso., National Bur. of Econ. Res., 1819 Broad 
way, N y , N. Y,, BOl W. ISlsl St, N. Y. SI 

Hasty, Willis L., Jr. B.C S. (Benjamin Franklin) Capt, Signal Corps, S47S S. Wake 
field St., Arlington, Va 

Hatke, Sister M. Agnes M.S. (Purdue Coll.) St. Francis Convent, 2701 Spring St. 
Ft. Wayne 8, Ind. 

Hayden, Byron R. B.A. (George Washington) Stat., Hqs. Army Air Forces, Wash. 25 
D. C., 1301 S. Cleveland St., Arlington, Va. 

Head, George A. B.E.E (New York) Tech. Staff, Bell. Telephone Labs., Inc., Murra; 
Hill, N J , / New England Ave., Summit 



MEMBERS OF THE INSTITUTE 


531 


Hebley, Henry F. Dir of Eea., Pittsburgh Goal Co , Oliver Bldg., Pittsburgh, Pa, (Last 
address) 

Hecht, Bernard B.E.E. (C.C.N.Y.) Mgr., Qual Control Dept., International Resist¬ 
ance Co., 401 N. Broad St., Philadelphia, Pa , SHIS-I McMichael St,, PhxUdel'phia 
Helde, John D. M.S. (Iowa) Gen. Labs., U. 8 Rubber Co., Market and South Sts., 
Passaic, N. J 

Headricks, Walter A. A.M. (George Washington) Prin. Agnc. Stat, Bur. Agrio Econ., 
Dept, of Agrio., Wash , D. C., 8901 Arlington Rd., Bethesda, Md. 

Henry, Malcolm H. M.S. (Michigan) Cost Analyst, Fisher Body Co., Fisher Bldg., 
Detroit 2, Mich., HB9J^ Lauder, Detroit 

Hess, Ida I. A.B (Indiana) Stat., Population Div., Bur of Census, Wash,, D. C., 
1J)SB Rhode Island Ave , N W 

HUdebrandt, Asso. Prof. Emanuel H. C. Ph D (Michigan) Dept of Math., North¬ 
western Univ., Evanston, Ill, 901 Golf Terraee, Wilmette 
Hlntermaler, John C. Chief Chemist, Vanity Fair Mills, Inc., Reading, Pa 
Hlrsch, Warren M. B A. (C.CN Y ) Tchr., Board of Educ , N. Y , N Y., 9791 Uni¬ 
versity Ave , Bronx 

Hizon, Manuel O. M A (Michigan) Grad Student, Math Dept , Univ of Mich, 
Ann Arbor, Mich , 19 Srd St., Balintawah, Rizal, Philippines 
Hodges, Joseph L., Jr, A.B. (California) Teaching Asst m Math., Univ. of Calif., 

. Berkeley 4, Calif., 40 Oakridge Rd., Berkeley B 
Hodgklnson, William, Jr. A.B. (Harvard) Stat, Amer Tel & Tel Co , 196 Broadway, 
N. Y. 7,N.Y. 

*Hoel, Asso. Prof. Paul G. Ph.D (Minnesota) Dept of Math , Univ. of Calif , Los 
Angeles 24, Calif , 113S7 Islela St 

Hoffer, Prof. Irwin S. M.B.A. (Harvard) Dept of Stat., School of Bus Adm , Temple 
Umv j Philadelphia 22, Pa , Willow Ave , Ambler 
Homseth, Richard A. M.A (Wisoonsm) Instr , Dept, of Sociology and Anthropology, 
Umv. of WiB , Madison 6, Wis. 

♦Horst, A. Paul Ph D. (Chicago) Proctor and Gamble, 6th and Mam Sts,, Cmoinnati, 
Ohio 

Hoskins, Robert H. A B. (Harvard) Actuarial Clerk, John Hancock Mutual Life Ins. 

Co , 197 Clarendon St , Boston 17, Mass , SO St Botolph St , Boston 16 
t*Hotelllng, Prof. Harold Ph,D, (Princeton) Inst, of Stat, Umv of N. 0,, Chapel Hill, 
N C. 

Householder, Alston S. Ph.D (Chicago) Math Consultant, Psychology Sec., Missile 
Control Div , Naval Res Lab , Wash 20, D. C , 4&16 Madison St., Hyattsville, Md. 
Houseman, Earl E. M A. (S. Dakota) Stat, Bur. of Agnc. Econ , Wash , D C., 986 
N. Kentucky St, Arlington, Ya. 

Howell, John M. B A. (U C L A ) Stat. Analyst, Qual. Control, Northrop Aircraft, 
Inc , Hawthorne, Calif , 4^40 W BSrd St, Los Angeles 4S 
Hoy, Elvln A. B S (Oregon State) Chief Stat. Sec., Fed Security Agency, Social Se¬ 
curity Adm , Bur Res. & Stat , Div Health and Disability Studies, 1825 H St , N.W , 
Wash. 25, D C , 9800 Erie Si., S E., Wash. 90 
*Hsu, Asso. Prof. Pao-Lu D.Sc (London) Inst of Stat., Univ. of N. C , Chapel Hill, 
N C 

Hughes, Harry M. M.A. (Texas) Teaching Asst., Univ of Calif, Berkeley, Calif., 
I 4 B 4 Bancroft Way, Berkeley 9 

Humes, Helen M. M A. (Pittsburgh) Price Econ., Bur of Labor Stat,, Dept of Labor, 
Wash , D C , S70S 34 th St, N W , Wash 8 

Humm, Doncaster G. Ph D (S California) Co-owner and Dir., Humm Personnel Serv¬ 
ice, 1219 W. 12th St , P O. Box 1433, Del Valle Sta., Los Angeles, Calif., 900 S. Windsor 
Blvd , Los Angeles 6 

♦Huntington, Prof. Edward V, Ph.D. (Strassburg) Prof Emeritus, Dept, of Math., 
Harvard Umv., Cambridge 38, Mass., 4® Highland St. 



532 MKMBKIIS OF THK INSTITUTE 

Hurwicz, Asso. Prof, Leonid Ij.L.W. (Warsaw) Iowa State Coll , Ames, Iowa, 0*1 
Fifth St 

*Hiirwltz, WllUoin 119 Concord Ave , Wash , I) C 

♦Ingraham, Dean Mark H. Ph D (Chirogo) Prof, of Math., ITmv, of Wis , Madison, 
Wis. 

Jahlon, Seymour A M. (Columbia) 200 W. 108th St,, N. Y., N. Y. 

♦♦Jackson, Prof. Dunham Ph.I). (Giittingon) Univ. of Mmn , 119 Folwell Hall, Minne¬ 
apolis, Minn, 

Jackson, Irwin E. M A.Mech.Eng (Pennsylvania) 193$ ISth St,, M.W., Apt. X, Wash., 
D. G. 

Jackson, Asst. Prof. Robert W. B. Ph.D. (Ixmdon) Dopt. of Educ. Res., Univ. of To¬ 
ronto, Toronto, Out.I Can., 194 Cranbrookc Ave. 

Jacob, Asst. Prof. Walter C. Ph D (Cornoll) Cornell Univ., Long Island Veg, Res. 
Farm, Riverhead, N Y 

Jacobs, Walter W, A.M (George Washington) Res. Analyst, Army Sec. Agency, Arling¬ 
ton, Va , 4704 N. SOlh Rd. 

Jacobson, Jack J. M.B A. (Chicago) Stat., Spiegel, Inc,, Chicago, III., SB6X W. Palmer 
Si 

Jahn, Fredrlc S. M.S (Florida) Pres , New Plastic Corp., 1017 N. Sycamore Ave., 
Hollywood, Calif,, 8 SS 4 De Longpre, Lo» Angeles 
James, R. W. M A. (Toronto) Dominion Bur of Statistics, Ottawa, Ont., Can. 

Janko, Prof. Jaroslav Technical Univ., Prague, Czoohoslovakia, Na bogiati S, Praha II 
Jarmlllo, Trinidad J. Ph.D (Cliioago) Rea. Math., Armour Rea. Foundation, 35 W. 

33rd St,, Chicago 10, Ill., 1947 S. Kedzte Ave , Chicago ®S 
Jarrett, Rheem ¥. A.B (Arizona) Lecturer, Dept, of Psychology, Univ of Calif., 
Berkeley 4, Calif., 1800 San Antonio Ave., Berkeley 7 
Jemlng, Joseph B, M.S (Columbia) Financial and Economic Consultant, 220 W. 42nd 
St, N. Y,, N. Y , /* Bronson Ave., Scaisdale 
Johner, Paul DS (Carnegie Inst, of Tech ) Ind Engr, Div,, Aluminum Co. of America, 
Now Kensington, Pa., 8SS Carl Ave. 

Johnsen, Madeline Ph.D. (Stanford) Instr., Dept, of Math., Purdue Umv,, Lafayette, 
Ind. 

Johnson, Asso. Prof. Evan, Jr. Ph.D. (Chicago) Pa. State Coll., State College, Pa., 
S4S S. Buckhoul St. 

Johnson, Prof. Palmer O. Ph.D, (Minnesota) Univ. of Minn , Minneapolis, Minn., 
SSIS Bdmund Ave 

Jones, Howard L. A.B, (Illinois) Supv. of Results, Ill. Bell Tel Co , 309 W Washington 
St,, Chicago 6, Ill., S8S0 N Claremont Ave., Chicago 18 
Jones, R. Richard, Jr. A B (Columbia) 81 Jackson St., New Rochelle, N. Y. 

Jones, Warren E. B.A. (Maryville) Pres, and Owner, Management Controls, 699 Rose 
Ave., Des Plaines, Ill. 

Juran, Joseph .I.D. (Loyola) Chm., Dept, of Adm. Engr , New York Univ., N Y. 53, 
N Y , 196 Beech St., Bronxville Manor, Tuckahoe 
♦Kac, Asst. Prof. Mark Ph.D. (Lwow) Dept, of Math,, Cornoll Univ., Ithaca, N, Y,, 
110 Eddy St. 

Kaltz, Alice S. M.A. (Columbia) Stat., Bur. of the Census, Wash. 26, D. C,, 8 IS 4 First 
Place, N.E., Wash. 11 

Kaltz, Hyman B. BA (George Washington) Stat, Social Security Adm., Social Security 
Bldg , Wash., D. C , 1008 Maasachusells Ave , N.W., F.C.-S, Wosh 1 
Kallnowskl, Walbert C. St John’s Univ , Collegeville, Minn,, 3689 West Pine Blvd., 
Si. Louis a, Mo 

Kampschaefer, Margaret A.B, (Indiana) Stat,, War Dept., HQ IX Air Force Serv. 
Command, Supply Div., APO 66, N Y., N. Y,, 1037 E. Blackford Ave., Evansville, 
Ind. 


♦* Deceased. 



MEMBERS OF THE INSTITUTE 


533 


Kaplansky, Irring Ph.D. (Harvard) Instr . Univ. of OHcago, Chicago, HI. 

Karp, Abraham E. M.S, (C C.N.Y ) Stat., Aberdeen Proving Gd , Md,, 55 Aberdeen 
Ave 

Katz, Amrom H. B.A. (Wisconsin) SO B. Wren Circle, Dayton 10, Oho 
Katz, Asst. Prof. Leo Ph.D. (Michigan) Mich, State Coll., p]ast Lansing, Mich., 18474 
Washburn, Detroit 21 

Kavanagh, Arthur J. Ph.B. (Yale) Asso. Physicist, American Optical Co , Scientific 
Instrument Div , Box A, Buffalo 16, N. Y., 180 Commonwealth Ave., Buffalo 18 
Keefe, David P. B.S (St. Thomas) Supv., Material Testing, Minn. Mimng and 
Mfg Co,, St. Paul 6, Minn,, S90 Holly Ave., Si. Paul 
Keeney, Roger D. A.B. (Bucknell) Math. Clerk, Metropolitan Life Ins. Co., 1 Madison 
Ave , N. Y., N, Y , 110 Fourmer Crescent, B Patterson, N. J. 

Keeping, Asso. Prof. Ernest S. D.I C (London) Univ. of Alberta, Edmonton, Alberta, 
Can,, 11124 28 Ave. 

fKeffer, Ralph M.A. (Wisconsin) Actuary, Aetna Life Ins Co , Hartford 15, Conn., 
42 Four Mile Rd., W. Harford 7 

Kefferstan, William Mgr., Eeon. Res. Dept, Boston Fiduciary & Res Assoc , 60 Con¬ 
gress St , Boston, Mass. 

Kelslar, Evan R. Ph D. (California) Instr , Princeton Univ., and Rea. Asso , College 
Entrance Exam Board, Princeton, N. J., Nassau Club 
t*Kelley, Prof. Tnunan L. Ph.D. (Columbia) Harvard Univ., Walker House, 40 Quincy 
St., Cambridge 38, Mass. 

Kellogg, Lester S. M A. (Northwestern) Chief, Prices and Cost of Living Branch, 
Bur, of Labor Stat , U. S Dept of Labor, Wash., D 0,404 Shady Lane, Falls Church, 
Va 

*Kendall, Maurice G. M A. (Cambridge) Asst. Gen. Mgr., Chamber of Shipping of the 
United Kingdom, Bury Court, St Mary Ave , London, E. C. 3, Eng , B07 Hood House, 
Dolphin Square, London, S W. 1 

Kennedy, Evelyn M. M.A. (Cincinnati) Math,, Applied Physics Lab , Johns Hopkins 
Umv., Silver Spring, Md , S91118ih St., N W , Wash. 11, D C 
Kenney, Asst. Prof. John F, A M, (Michigan) Univ. of Wis, at Milwaukee, Wis., 
62S W State St 

Kent, Robert H. M A. (Harvard) Chief, Exterior Ballistic Lab , and Asso, Dir of Ballis¬ 
tic Res. Labs , Aberdeen Proving Gd., Md , SOO S. Union Ave., Havre de Grace 
Keppler, Wharton F. B A. (Ohio State) Stat., M. & R Dietetic Labs , Inc , 8 E Long 
St. I Columbus 16, Ohio, 11 Nottingham Rd , Columbus 2 
Keyfitz, Hathan B.Sc. (McGill) Stat., Dominion Bur. of Stat, Ottawa, Out., Can , 

5 Bristol Ave 

Klchline, Asst. Prof. William. L. MS (Lehigh) Univ of New Hampshire, Durham, 

N H 

Kimball, Bradford F. PhD. (Cornell) Sr Stat., State Public Service Comm , 233 Broad¬ 
way, N Y. 7, N. Y., 33 Bogart Ave., Port Washington 
Klndig, Fred E. B S (Pennsylvania State) Industrial Math., Westinghouse Elec, Co., 
Braddock Ave., E. Pittsburgh, Pa , 53 Nantucket Dr., B D 6, Pleasant Hills, Pitts¬ 
burgh 10 

King, Arnold J. B.S. (Wyoming) Sr. Stat, U S, Dept of Agrio, and Iowa State Coll., 
Stat. Lab., Ames, Iowa 

King, Frederick G. A B. (Harvard) Instr,, Anti-Aircraft Artillery School, Fort Bliss, 
Texas, 1312 Montana, El Paso 

Kingston, Prof. Jorge C.B. (Brazil) 23 Rua Rita Ludolf, Rio de Janeiro, Brazil 
Klnsler, David M. M A (Chicago) Chief, Analytical Sec., Arms and Ammunition 
Div., Ord. Res and Devel. Center, Aberdeen Proving Gd , Md. 

Kirchen, Calvin J. M.A. (Wisconsin) Remington Arms Co , Bridgeport, Conn,, 107 
Wood Ave , Stratford 



534 


MEMBERS OF THE INSTITUTE 


Klauber, Laurence M. A.B (Stanford) Vice Prea. and Gen, Mgr., San Diego Gas <fc 
Elec Co , San Diego 12, Calif., SS3 W. Jumper St., San Diego 1 
Klein, Lawrence R. PhD. (Mass Inst Tech.) The Cowles Gornra.., TJniv. of Chicago, 
ChicagC 37, III 

Knoepfel, Margaret F. A.B. (Brooklyn) Asst. Stat, Weather Bur., $SB 96lh St., Brook¬ 
lyn 9, N. Y., until June 1, 1947; then, 8306 Ely PI. S.E , Waeh. 19, D. C. 

Knowler, Asso. Prof. Lloyd A. Ph.D. (Iowa) Asso. Prof, and Chm., Dept of Math., 
State Univ of Iowa, Iowa City, Iowa, S Woolf Ave. Court 
Knudsen, Lila F. B.S (Minnesota) Stat., U. S Food and'Drug Adm , Wash. 26, D. C,, 
8901 Connecticut Ave., N. W., Wash 8 

Konljn, Hendrik S. M A. (Columbia) Res. Analyst, U. S. Dept, of State, Wash., D. G., 
1883 N. Monroe St., Arlington, Va. 

*Koopmans, Asso. Prof. Tjolllng Ph D. (Leiden) Dept, of Econ., Univ. of Chicago, 
Chicago 37, Ill. 

Kopp. Paul J. M.A. (Duke) Patent Dept., Gulf Oil Corp., Munsey Bldg., Wash , D.C., 
ISOS N Adams St., Arlington, Va 

Kosambl, D. D. Sc.B. (Harvard) Dept, of Math., Tata Inst, of Fundamental Res , 53 
Peddcr Rd., Bombay, India 

Kossack, Carl F. Ph.D (Michigan) Math., Joint Army-Navy Air Intelligence, The 
Pentagon, Wash., D. C., BSll Blair lid., N.E., Wash. 11, D C. 

KozaklewUz, Asst. Prof. Waclaw Ph.D. (Warsaw) Dept, of Math., Univ. of Saskatche¬ 
wan, Saskatoon, Saskatchewan, 908 Saskatchewan Crescent E. 

Kozelka, Dean Richard L. Ph.D. (Minnesota) School of Bus. Adm , Univ. of Minn., 
Minneapolis 14, Minn., 83 Bedford Si., S.E. 

Kramer, Morton So.D, (Johns Hopkins) Tuberculosis Stat. for Cuyahoga County, 
Tuberculosis Clinic, 200 High Ave., Cleveland 16, Ohio 
Kruskal, William M S. (Harvard) ISO W. SOlh St., N, Y., N. Y. 

Kubls, Asso. Prof. Joseph F. Ph.D. (Fordham) Dept, of Psychology, Fordham Univ. 

Grad. School, Bronx, N. Y., 813 Calyer St., Brooklyn 
*Kullback, Solomon Ph.D. (George Washington) Dept, of Stat., George Washington 
Univ., Wash., D. 0., 18S9 Van Buren St., N.W. 

Kury, Anita R. M.A. (Michigan) Community Pricing Sec., OPA, Wash., D. C , 1981 
Kalorama Bd., N W., Wash 9 

Kuznets, Asst. Prof. George M. Ph D. (California) Univ. of Calif,, Berkeley, Calif. 
Kwerel, Seymour M. B.S (C.C.N.Y.) International Stat. Bur., Inc , 350 Fifth Ave , 
N. Y 1, N Y,, El- Washington Ave , N. Y. 33 
Kyle, Garland D. M.A, M S. (Ohio State, Michigan) Physicist, USN, 911 E. 28nd 
St., Minneapolis, Minn. 

Lacey, Prof. Oliver L. Ph.D (Cornell) Head, Dept, of Psychology, Umv. of Ala., 
University, Ala. 

Ladd, Robert B. B.A. (Texas Coll, of Arts and Ind ) Chief Stat., Commercial Traffic 
Service, Transportation Corps, U. S War Dept., Rm. 4C-649, Pengaton Bldg., 
Wash., D. C., 90S Wade Ave., Rockville, Md. 

Laderman, Jack M.A. (Columbia) Stat. Analyst, Chemical Corps, War Dept., Ill 
E. 16th St, N. Y. 3, N. Y., 3830 Cruger Ave,, Bronx 67 
Laguardla, Prof. Rafael (Uruguay) Dir., Institute do Matematioa y Estadistica, Fa- 
oultad do Ingemeria, Cerrito 73, Montevideo, Uruguay 
Lancaster, Otis E. Ph.D. (Harvard) Prin Math, Bur. of Aeronautics, Navy Dept, 
Wash., D. C , 4607 87lh St., Ml. Rainier, Md 

Landau, Hyman G. Ph.D. (Pittsburgh) Math., Ballistic Res. Labs., Aberdeen Proving 
Gd., Md., 8S9 Wilson St., Havre de Grace 

Lange, Prof. Oscar LL.D. (Cracow) Polish Embassy, 2640 16th St., N.W , Wash. 9, 
D C. 



MEMBERS OP THE INSTITUTE 


535 


Langmiilr, Charles R. Ph.B. (Yale) Secy -Treaa. and Lab Dir , Bennett and Langmuir 
Development Corp., 126 Spencer PL, Mamaroneck, N Y., 49 Mulberry Rd., New 
Rochelle 

Larsen, Asso. Prof. Harold D. Ph D. (Wiaeonsin) Umv. of New Mexico, Albuquerque, 
N. M 

Larson, Charles M. B.So. (Nebraska) Math., Pacific Mutual Life Ins. Co., Los Angeles, 
Calif , 5144 W. ISSlh St., Hawthorne 

LaSala, Lucy A. M.A (Columbia) Math Teacher, East New York Voc. H.S , Brooklyn, 
N. Y , SS6 Irving Ave., Brooklyn 97 

Lavln, Marvin M. M S. (Chicago) Qual Control Stat, Corning Glass Works, Cormng, 
N. Y , 5349 Kenmore Ave., Chicago, III. 

Leavens, Dickson H. M A. (Yale) Rea Asso., Cowles Comm for Res, in Econ,, Univ. 
of Chicago 37, Ill , 1151 E. S6lh St 

Leepln, Peter Ph D. (Basle) Actuary, Basle Life Ins Co , Basle, Switzerland, Gellersir. 
59 

Lefever, Prof. D. Welty Ph.D. (S. California) Dept of Eduo , Umv of S. Calif , Univ. 
Pork, Los Angeles 7, Calif 

Lehmann, Erick L. PhD. (California) Instr., Univ of Calif., Berkeley 4, Calif., 40 
Oakridge Rd , Berkeley 5 

Lelbler, Richard A. Ph.D, (Blinois) Member, Inst, for Advanced Study, Princeton, 
N J., 34 Vandeventer Ave 

Leighton, Prof. Walter, Jr. Ph D. (Harvard) Dept of Math., Washington Univ , St. 
Louis 6, Mo 

Lelpnlk, Hoy B. B S. (Chicago) Rea Asst, Cowles Comm , Umv of Chicago, Chicago 
37, Ill., 5597 S Kenwood (Last address) 

LeLeiko, Max B S (New York) Instr , Rutgers Umv , New Brunswick, N. J 

Leone, Fred C. MS (Georgetown) Ensign, 832 E. 225th St., Bronx 66, N. Y., 310 N. 
Salisbury St, W. Lafayette, Ind. 

Lesansky, William A. B,B A (O.C.N Y.) Stat., War Dept , Wash, D. C , 5311 4lk 
St., N W., Wash 11 

Lessard, Prof. Roger B.A Sc. (Montreal) Hull Tech School, 109 Wright, Hull, Que., 
Can , 6914 Donormantnlle, Montreal 8 

Lesser, Grace L. B A. (Hunter) Stat, The Econometrics Inst., Inc., 600 Fifth Ave., 
N Y , N. Y , 1578 Unionport Rd., Bronx 69 

Levene, Howard B.A (New York) Instr. in Biometrics, Univ. Extension, Columbia 
Univ , N Y ,1:1. Y., 99 E 88th St, N. Y. 98 

Levin, Joseph H. Ph.D. (Chicago) Chief, Machines Sec , Computing Lab , Ballistic 
Res Labs., Aberdeen Proving Gd , Md 

Levine, Harriet A B (Hunter) Asst. Math Stat., Stat Res. Group, Columbia Univ., 
N Y., N. Y , 309 W. 99lh St., N. Y. 95 (Last address) 

Levine, Myra AM (Columbia) Stat, Socony Vacuum Oil Co., 26 Broadway, N. Y. 4, 
N. Y ,309 W 99lh St., N. Y 95 

Lew, Edward A. AM. (Columbia) Asst. Actuary, Metropolitan Life Ins, Co., 1 Madison 
Ave,, N. Y., N Y., 51 Mohegan Rd , Larchmont 

Lewis, E. Vernon Ph.D. (Maas Inst. Tech.) Jr. Res. Asso , E. I. du Pont de Nemours 
and Co., Exp. Sta , Wilmington, Del, 109 N Rd Inndamere, Wilmington 974 

Lewis, Wyatt H, B S (Calif. Inst. Tech.) Qual Control Engr., General Elec. Co., 
Ontario, Calif , 919 E. H St. 

Li, Jerome C. R. Ph D. (Iowa State) Instr , Oregon State Coll., Corvallis, Oregon, 
9305 Jackson St. 

Lieherman, Jacob E. B.S. (Brooklyn) Stat., Census Bur., Wash , D. C , 9499 14 th St., 
N.E. 

Lleblein, Julius M. A. (Brooklyn) Econ. Analyst, Div. Tax Rea., U. S. Treasury Dept., 
15th and Pa. Ave., Wash. 25, D C., 9710 99th St, S E , Wash. 90 



536 


MEMDKIlfl OP THE INSTITUTE 


Lien, Roy M S. (Oregon State) Rate Stat., Northwestern Elec. Co., Portland, Oregon, 
StSl S.E. Division Si , Portland S 

Likert, Rensls Ph.D (Columbia) Head of Div. of Program Surveys, B.A.E., Dept of 
Agric , Wash , D. C., 4SSS Drummond Ave , Chevy Chase IB, Md 

Lindsey, Fred D. M A. (George Washington) Stat., Econ. Res. Dept., U. S Chamber 
of Commerce, 1(515 H St., N.W , Wash., D. C., 4B1} Stanford St , Chevy Chase, Md. 

Llttftuer, Prof. Sebastian B. D Se. (Mass Inst. Tech,) Chm , Dept, of Math , Newark 
Coll, of Engineering, 367 High St., Newark 2, N J. 

Livers, Asso. Prof. Joe J. Ph.D, (Michigan) Montana State Coll., Bozoman, Montana, 
404 W. Arthur 

Locatelll, Humbert 44 Seaman Ave , Manhattan, N. 7. 

Lonsethi Asst. Prof. Arvld Ph D. (California) Math. Dept, Northwestern Univ, 
Evanston, Ill. 

Lopata, Simon M.A. (Brooklyn) Inatr in Econ, and Stat., Rutgers Umv., New Bruns¬ 
wick, N J , .(3 Hart St., Brooklyn 6, N. Y. 

Large, Prof. Irving Ph.D. (Columbia) Teachers Coll., Columbia Univ., 525 W 120th 
St, N. Y. 27, N Y., S90 Riverside Dr , N Y. 25 

*Lotka, Alfred J. D Sc. (Birmingham) Asst. Stat., Metropolitan Life Ins, Co , 1 Madison 
Ave., N. Y 10, N Y., Beattie Park, Red Bank, N. J. 

Lowry, Edward D. Stat., Western Cartridge Go,, B. Alton, Ill , 60SBlhSt 

Lukacs, Prof. Eugene Ph D. (Vienna) Our Lady of Cincinnati Coll., 2220 Victory 
Parkway, Cincinnati, Ohio, SSI McGregor Ave,, Cincinnati 19 

Lundberg, Prof. George A. Ph.D. (Minnesota) Walker Ames Head of Soe. Dept, 
Umv of Wash , Seattle, Wash. 

Lyons, Will B So. (Buoknell) Economist, Office of Econ. Review and Analysis, Civilian 
Prod Adm., Wash. 26, D. 0., ISBl Taylor St., N.W., Wash. 11 

MacNelsh, Harris F. Ph.D. (Chicago) Chm., Math. Dept, Brooklyn Cloll,, Bedford 
Ave. and Ave, H, Brooklyn, N. Y. 

Macphall, Prof. Moray St. J. Ph.D. (Oxford) Acadia Univ., Wolfvillc, Nova Scotia, 
Can. 

Maddrlll, James D. Ph.D. (California) Section Chief, Ballistic Res. Lab , Aberdeen 
Proving Gd., Md., 54 Liberty St., Aberdeen 

Madow, Lillian H, M.A, (American) 2264 Gresion Ave., N.Y. SS, N Y. 

*Madow, Asso. Prof. William G. Ph D (Columbia) Inst, of Stat., Univ. of N. C , Chapel 
Hill, N. C., 2264 Creston Ave., N. Y BS, N. Y. 

Maloney, Clifford J. M.A. (Minnesota) Instr., Math Dept., Iowa State Coll., Ames, 
Iowa 

Malzberg, Benjamin Ph.D. (Columbia) Dir., Bur. of Stat., New York State Dept, 
of Mental Hygiene, State Office Bldg., Albany, N Y., SS Bancker St. 

Mandel, John M S. (Brussels) Res. Chem., B. G. Corp., 136 W. 62nd St., N Y., N. Y. 

*Mann, Asso. Prof. Henry B. Ph D. (Vienna) Ohio State Univ., Columbus 10, Ohio, 
161 Arden Rd 

Mansfield, Ralph S. M. (Chicago) Res. Engr., Jos. Weidenhoff Inc., 8049 S. Maryland, 
Chicago 19, Ill. 

Manuele, Joseph Dir. of Qual Control, Westinghouso Elec. Corp , E. Pittsburgh, Pa., 
Box 984 

Marcuse, Sophie M.A. (Columbia) Asst. Homo Economist, Dept of Agric,, Wash., 
D. 0., 4609 Chevy Chase Blvd., Chevy Chase 16, Md. 

Marks, Ell S. Ph.D. (Columbia) Prin. Bus Economist, OPA, Wash. 26, D. C., 3711 
Horner PI,, S.E., Wash. 20 

Marxian, Dixon M. M.A. (Columbia) 1506 Shadyside Rd., Baltimore 18, Md. 

Marschak, Prof. Jacob PhD. (Heidelberg) Dept, of Econ., Umv, of Chicago, Chicago 
87, Ill , 1SS5 E. 52nd St., Chicago 15 



MEMBEKS OF THE INSTITUTE 


537 


Martin, Cyrus A. B S. (Iowa State) Stat, Hqs. Fifth Army, 7811 South Shore Dr., 
Chicago 49, III. 

Martin, Asst. Prof. Margaret P. Ph D. (Minnesota) Univ, of Minn , 118 Millard Hall, 
Minneapolis 14, Minn , 1365 Selby Ave , St Paul 4 
Martin, Prof. William T. Ph D (Illinois) Dept of Math , Mass Inst of Tech , Cam- 
bridge 39, Mass., 81 Kilburn Rd., Belmont 18 
Massey, Frank J., Jr. M.A (California) Asso in Math , Univ. of Calif, Berkeley, 
Calif., 1364 Union St., San Francuco 9 

Mathleus, George J. B L (Dayton) Inetr , Army Service School, Douglas Aircraft, 
Long Beach, Calif , 1S5S Poppy St., Long Beach B 
Mathlsen, Harold C. A B (Princeton) B9 Pemwood Rd , E. Orange, N J 
Mauchly, John W. Ph D. (Johns Hopkins) 319 St Marks St , Philadelphia 4, Pa 
Maxwell, Pat, Jr. B S (Fordham) Stat, Surveillance Br , Ballistic Res Lab , Aber¬ 
deen Proving Gd., Md., Apt. A6-S, Baldwin Manor, Aberdeen 
fMayer, George F. T. M A (Minnesota) Fiscal Acet, Minnesota State Dept of Educ , 
308 State Office Bldg , St Paul, Minn., 1S19 7th Si., S E., Minneapolis 
Maynard, Burton I. A.D (Stanford) Stat Analyst, Douglas Aircraft Co., Santa Monica, 
Calif , 918 S Mullen Ave , Los Angeles 6 

Mazza, Prof. Slgfrldo C. Dir , Instituto de Estadistica, Facultad de Ciencido Eoo-' 
nomicas, Tristan Narvaja 1474, Montevideo, Uruguay 
McBee, Ethelyne L. M.A (Columbia) Sci. and Math Teacher, Falls Church H S , 
Falls Church, Va , 3136 N. Stafford St., Arlington 
McCarthy, Michael D. Ph.D (Unit. Coll, Cork) Lecturer, Umv Coll, Cork, Ireland 
McCarthy, Philip J. M A, (Princeton) Social Science Res. Council, Fine Hall, Princeton 
Umv , Princeton, N J. 

McCormick, Prof. Thomas C. T. Ph.D. (Chicago) Dept of Sociology, Univ. of Wis , 
Madison, Wis , 4903 Wavesah Trail, Madison 6 
McEwen, Prof. George F. Ph D. (Stanford) Dept, of Physical Oceanography, Senpps 
Inst, of Oceanography, Univ of Calif , La Jolla, Calif., P 0. Box 109 
McGann, Asst. Prof. Paul W. A.B. (Brown) Dept of Econ , American Umv , Wash., 
D C , 330 N Piedmont, Apt. 4, Buckingham Community, Arlington, Va. 

McIntyre, Donald P. M.A (Toronto) Meteorological Office, Dept, of Transport, Mon¬ 
treal Airport, Dorval, Que., Can. 

McIntyre, Francis E. PhD (Chicago) Deputy Dir for Export Control, Commodities 
Branch, Office of International Trade, U S Dept, of Commerce, Wash 26, D C , 
B514 Norlhfield Rd , Selhesda 14, Md. 

McPherson, John C. B.S. (Princeton) Dir of Engineering, International Business 
Machines, 590 Madison Ave,, N Y 22, N. Y , Short Hills, N. J 
Merrell, Asso. Prof. Margaret Sc D. (Johns Hopkins) Dept of Biostatistics, Johns 
Hopkins School of Hygiene, 615 N Wolfe St, Baltimore 5, Md , 1808 Eulaw PL, 
Baltimore 11 

Michael, William B. M S (S. California) Lecturer, Umv of S Cahforma, Umv. Park, 
Los Angeles, Calif , 388 S. Oak Ave , Pasadena 10 
Mlchalup, Eric LL.D. (Vienna) Actuary, La Previsora, Aparlado 848, Caracas, Vene¬ 
zuela 

Miller, Robert C. Eos Engr,, Elgin National Watch Co,, Elgin, HI 
Millikan, Max F. Ph.D, (Yale) Res Asso , Yale Umv., New Haven, Conn , Perkins 
Rd , Woodbndge 

Mills, Vicente P.O Box 2090, Manila, Philippine Islands 

Miner, John R. Sc D (Johns Hopkins) Asso. Editor, Mayo dime, 102-110 2nd Ave., 

S W , Rochester, Minn., 619 1th Ave., S.W. 

*Mlses, Prof. Richard von Dr. (Vienna) Harvard Univ , Cambridge, Mass., 31 Concord 
Ave , Cambridge 38 



538 


MKMBKHH OF THK INSTITUTE 


Mlttra, Probodh C. M.A. (Cijlumbia) Statistician, United Nations, Lake Success, 
N Y., n E. 94 th SI., N. Y. 

Mode, Prof. Elmer B. A.M. (Harvard) Boston IJniv , 688 Boylston St,, Boston, Mass., 

9 Longmeadow Rd , Wellesley Si 

t*MoUna, Prof. Edward C. Newark Coll, of Engineering, 365-309 High St , Newark 2, 
N 3 , 141 Dodd Si , E. Orange 

Monro, Sutton B.S. (Mass. Inst, Tech.) Instr., Dept, of Math, and Astron., Univ. of 
Maine, Orono, Maine 

Montes, Jose G. Civ. E (Havana) Dir. of Industry, Ministry of Agric,, Avemda de 
Belgtca Havana, Cuba 

♦Mood, Abso. Prof. Alexander M. Ph.D. (Princeton) Stat. Lab , Iowa State Coll., Ames, 
Iowa, StS7 Counlry Club Bind. 

Moore, Margaret E. B,A (Wilson) Stat, Navy Dept., Naval Ord. Lab., U. S Naval 
Gun Factory, Wash. 26, D. C., SO 4 Lincoln Way W., Chambersburg, Pa 
Moore, Marjorie E. Ph.D, (Minnesota) Program Analyst, Pod. Security Agency, Office 
of Voc, Hehabilitation, 815 Connecticut Ave., N W,, Wash., D C., The Meridian 
mu, mt leih Si., N W., Wash. 9 

Morrison, Nathan A.B (Brooklyn) Pnn. Actuary, N Y State Div. of Placement and 
Unemployment Ins , 342 Madison Ave , N Y. 17, N. Y., 4!B0 Ave. F, Brooklyn 18 
Morrow, Asst. Prof. Dorothy J. MS (Washington) George Washington Umv , Wash. 
0, D G , BUS F Si., N.W., Wash. 7 

•fMorse, John W. M.A (Columbia) Chief Control Aid, Stat. Sec , U. S. Public Health, 
Bethesda Station, Betheada, Md., 19 Westwood Dr., Wash 16 
Morton, Prof, Joseph E. D.Sc. (Geneva) Warren Hall, Cornell Umv., Ithaca, N. Y,, 
and Consultant, Nat’l Bur. of Ecoii. Res, 254th St. and Indopendencc Ave , N. Y. 
63, N, Y., SS7(h Si. and Palisade Ave., N. Y. 6S 
Moslmann, Thomas F. A B. (Charleston) Regional Employment Analyst, U. S. Bur. 

Labor Statistics, Dallas, Texas, 4S16 Western Ave , Dallas 11 
Moss, Judith B.A. (Vassar) Res. Asst, Nat’} Bur. of Econ. Res., 1819 Broadway, N 
Y., N. Y , S19 SI Johns PL, Brooklyn 17 

Mosteller, Frederick PhD, (Princeton) Lecturer and Res. Asso., Dept, of Social Rela¬ 
tions, Emerson Hall, Harvard Univ , Camb’-idge, Mass., 6S Dunsler St., Apt. SS 
Motock, George T. M.S , M.E. (Carnegie Inst. Tech., Ohio State) Box 305, Station D, 
Cleveland 4, Ohio 

Mottley, Charles McC. Ph.D (Toronto) Lt, USNR, Bur of Ships, Code 333, Navy 
Dept , Wash., D C., S51S S. Wakefield Si , Arlington, Va. 

Mouzon, Prof. Edwin DuB., Jr. Ph.D. (Illinois) Chin., Dept, of Moth , Southern Metho¬ 
dist Univ., Dallas, Texas, 2816 E Lovers Lane, Dallas 6 
Mudgett, Prof. Bruce D. PhD (Pennsylvania) Dept, of Econ. and Stat., Univ. of 
Minn , Minneapolis 14, Minn., 1417 E. River Rd. 

Muench, Prof. Hugo Dr.P.H (Johns Hopkins) Dept, of Bioatat, Harvard School of 
Public Health, 66 Shattuck St., Boston 15, Mass. 

Mummery, Charles R, B A.So. (Toronto) Sec. Head and Roe. Engr , The Hoover Co., 
N. Canton, Ohio, 606 E. Maple St. 

Murphy, Barbara M. Librarian, Raytheon Mfg. Co,, Power Tube Div,, Foundry Ave,, 
Waltham 54, Mass. 

Murphy, Ray B. B.A (Princeton) Princeton Univ , Princeton, N J., 28 Godfrey Rd., 
Upper Montclair 

Murphy, Ray D. A B (Harvard) Vicc-Pres. and Actuary, Equitable Life Assurance 
Soc , N. Y , N. Y., 28 Godfrey Rd., Upper Montclair, N. J 
Murray, Jfanet H. M.A. (Stanford) Asst. Head, Family Eo Div , Bur. Human Nutri¬ 
tion and Home Ec , U. S Dept, of Agric , Wash., D. C , 1026 Connecticut Ave , Wash. 6 
Myers, James E. A.B (Michigan) Res Group, Moore School of Elec Engr , Univ. of 
Pa , Philadelphia, Pa , 1S12 Pine St , Philadelphia 7 



MEMBERS OF THE INSTITUTE 


539 


MysUvec, Prof. Vaclav Sc.D. (Czech Tech Umv„ Prague) Delegate of Czechoslovak 
Gov’t at the United Nations Comm on Food and Agno., Rm 608, 1775 Broadway 
N. Y. 19 

Nannl, Liils F. G. E (Tucuman) Fine Hall, Princeton Univ,, Princeton, N J., Oradu- 
ale Coll. 

Nash, Stanley W. M.A (U. of Cal., Berkeley) Grad Student, Umv. of Calif , Berkeley, 
Calif , SUSS Chanmng Way, Berkeley ^ 

Neifeld, Morris R. Ph.D. (New York) Economist, Beneficial Mgt Corp., 15 Wash 
St., Newark, N. J., 6J^ Prospect St., Maplewood 
Nekrassoffr Vladimir A. Dr Eng. (Miohailovskaya Acad of Art) Asst. Ballistician, 
Ballistic Res. Lab . Aberdeen Proving Gd., Md , 143 Weher St, Havre de Grace 
Nelson, Franklin S. AM. (Columbia) Chief, Qual Control Sec,, Picatinny Arsenal, 
Dover, N J,, S Woodland Ave , Summit 

Nenuners, Frederic E. S.M (Iowa State) Instr, Umv. of Wis , Madison, and Con¬ 
sulting Engr , Ladish Drop Forge Co., Cudahy, Wis , 3936 N. Hackeit Ave , Milwaukee 
11 

Nesbitt, Asso. Prof. Cecil J. Ph.D (Toronto) Dept of Math , Umv. of Mich., Ann 
Arbor, Mich , 1913 Frieze Ave, 

♦Neumann, Prof. John von PhD (Budapest) Dept of Math, Inst for Advanced 
Study, Princeton, N. J , Westcott Rd. 

Neurath, Paul M. LL D (Vienna) Lecturer, Coll of the City of New York, 17 Lexing¬ 
ton Ave., N Y,N Y.,B49W llSthSt,N Y 35 
Neurdenburg, Dr. M. G. M D (Leiden, Holland) Medical Officer, Head of Dept of 
Stat., Munio Medical and Health Service, Amsterdam C (Holland), Amsterdam 
Zuid 1, Holland, Frans van Tierisstraat 134- 

Newman, Doris M A (Michigan) Instr. in Stat, School of Bus, Adm, Crosby Hall, 
Buffalo 14, N Y 

♦Neyman, Prof. Jerzy Ph.D. (Warsaw) Dept of Math. Stat, Columbia Umv., N Y., 
N. Y., 954 Euclid Ave., Berkeley (on leave from Stat Lab., Umv of Calif , Berkeley, 
Calif., first sem 46-47) 

Nichols, Russell T. A.B (DcPauw) Student, Univ of Chicago, Chicago, 111, I 4 I 8 
Hyde Park Blvd,, Chicago 15 

Nicholson, George E., Jr. M A. (Carolina) Asst Math , Applied Math Group, Colum¬ 
bia Univ., N Y , N Y., 176 Park St, Montclair, N J 
Nllson, Hugo W. PhD. (Minnesota) Gheimst, U S Fish & Wildlife Service, Coll 
Park, Md , 1107 Flower Ave , Takoma Park 13 
Nlsselson, Harold B S. (CCNY) Stat,, Bur of the Census, Wash., D C,, 4^33 N. 
Fourth St , Arlington, Va 

Noel, Roland H. M S. (Mass Coll, of Pharmacy) Spec. Asst to Prod. Mgr., Bristol 
Labs , Inc., Syracuse, N. Y., 310 Milford Dr., W. 

Noether, Guttfrled, E. M.A (Illinois) Grad. Student, Columbia Univ, N. Y 27, N. 
Y , 536 W. 114 th Street, N Y 35 

Noland, Asst. Prof. E. William Ph.D (Cornell) Labor and Management Center, Yale 
Univ , 333 Cedar St., New Haven 11, Conn. 

Norden, Monroe L. SB. (Maas. Inst. Tech.) Stat, Aberdeen Proving Gd , Md., 55 
Nagle Ave , N. Y 34, N. Y 

Nordipilst, John M. M.S. (Oklahoma) Res Asst, Seismological Lab , 220 N San Rafael 
Ave , Pasadena 2, Calif , 1695 Corson St., Pasadena 4 
Norris, Nllan Ph.D, (Stanford) Dept.of Econ .Hunter CoU.,N.Y.21,N Y. 

Northam, Jack 1. M.A (Michigan State) Teaching Fellow in Math., Univ. of Mich., 
Ann Arbor, Mich , 1351 Sharon Ct., Willow Village 
Norton, Horace W. Ph D. (London) Meteorologist, US Weather But , Wash. 25, D. C., 
3118 N. First Rd , Arlington, Vo 



540 


MKMUKnS OP THE IXHTITUTK 


fNorton, Kenneth A. B.S. (Chicago) Physicist, Central Radio Projiagation Lab., National 
Bur. of Standards, Wash. D. C , Kermwre Dr., N W., Wash 7 

Oakley, Prof. Cletus 0. Ph D. (IHihoib) Haverford Coll , Haverford, Pa. 

O’Callahan, Rev. Joseph T. S.T.L. (Gregorian) Prof, of Math., Holy Cross Coll., 
Worcester, Mass. 

O'Connor, Howard J. M A. (Toronto) Tech. Asst, Union Carbide & Carbon Res. 
Labs., Inc., Development Div., Electro Metallurgical Co., 47th St., Niagara Palls, 
N. Y., 1016 Cleveland Ave. 

Odle, John W. PhD (Michigan) Co-Hcad, Math Section, US Naval Ord. Tbst Sta., 
Inyokern, Calif. 

Okun, Yetta E. B.A. (Hunter) Res Asst, Dept, of Ijabor, Wash , D. C , SISO 16th St., 
N.W., Wash. 9 (Last address) 

Olds, Edward B. Ph D. (Pittsburgh) Dir., Res. Bur. of Social Planning Council, 613 
Locust St, St. Louis 1, Mo. 

*01ds, Asso. Prof, Edwin G. Ph.D (Pittsburgh) Dept, of Mat)^ , Carnegie Inst, of 
Tech., Schenley Park, Pittsburgh 13, 2SS Gladstone Rd , Pillshurgh 17, Pa. 

Olllvler, Asso. Prof. Arthur Ph.D. (Iowa) Acting Head of Dept., Missiasippi State 
Coll, Box 604, State College, Miss. 

Olmstead, Paul S. Ph.D. (Princeton) Tech. Staff, Bell Tel Labs., N. Y , N. Y., Box 
72, Essex Falh, N J. 

Olshen, Abraham. C, Ph D. (Iowa State) Actuary & Coinptroller, West Coast Life Ins. 
Co., 60S Market St., San Francisco 6, Cnlif., 41^6 Vidal Drive, San Francisco IS 

O’Nell, Frank (Lowell Textile Inst.) Sr. Textile Tech,, Worsted Div., Pacific Mills, 
Lawrence, Mass. 

Oosterbof, WllUs M. M.A. (Michigan) Chief Stat., State Dept, of Social Welfare, 
Lansing, Mich., 811 Hackell Si , Ionia 

Orcutt, Guy H. Ph.D. (Michigan) Dept, of App. Economics, Kings Coll , Cambridge 
Univ., Cambridge, Eng. 

Ore, Prof, Oysteln Ph.D. (Oslo) Yale Univ., New Haven, Conn 

Osborne, James G. B S, (California) Chief, Sec Forest Meas., Forest Service, Wash., 
D. G. 

Ostermon, Herbert W. B. E. Wyatt Co., 1020 Vermont Ave., N.W., Washington, D. C. 

"O'Toole, Alphonsus L. Ph.D. (Michigan) Lt., Staff Com. S. Pacific Fleet P.O., San 
Francisco, Calif. (Last address) 

Gtt, Asso. Prof. Ellis R. Ph.D. (Illinois) Dept, of Math , Rutgers Univ., New Bruns¬ 
wick, N. J., 199 Sterling Dr., Orange 

Owen, F. V. Ph.D. (Wisconsin) Geneticist, USDA, 1910 S Main St., Salt Lake City, 
Utah 

Owen, Ruth L. M.A (Columbia) Lt. (j g.) Disbursing Officer, Box 1395 USNAS, 
Navy 129, FPO, San Francisco 1, Calif. 

Page, Warren H. B A. (Queens) Student, Columbia Univ,, N. Y., N. Y,, SS-^t Junc¬ 
tion Blvd. Jackson Heights 

Parke, Nathan G. Ill, A.B. (Princeton) Res. Asso. in Physics, Res. Lab, of Electron¬ 
ics, Mass. Inst Tech , Cambridge, Mass., Spencer Brook Rd., Concord 

Pascua-Martbaez, Asst. Prof. Marcelino M.D. (Madrid) Dept, of Blostatistics, .lohns 
Hopkins Univ., School of Hygiene, 616 N. Wolfe St., Baltimore 6, Md., 10 E. SSrd 
St., Baltimore 18 

Passano, Russell F. B.S. (Johns Hopkins) Metallurgical Engr., Bethlehem Steel Co., 
Bethlehem, Pa., 809 Wall St. 

Pastore, Nicholas M.S. (CCNY) Instr., Union Jr. Coll., Cranford N. J., 8844 Radcliff 
Ave., N. Y., N. Y. 

Patte, WUllam E. B.A Sc. (Toronto) Stat. En^r., Canadian Ind. Ltd., Bhawinigan 
Falls, Que., Can., SSO-lOth St., AlmavtUe, Que. 



MEMBEHS OF THE INSTITUTE 


641 


Pauli, Allan E. B.A. (Manitoba) 114 Mangurn, U of N, C., Chapel Hill, N. C. 

Paulson, Edward M.A (Columbia) Inatr,, Inst, of Stat., Univ. of N. C., Chapel Hill, 
N. C. 


Payne, Asbo. Prof. Charles K. Ph.D. (New York) Dept, of Math., New York Univ., 
100 Washington Square E., N. Y., N. Y , SS ValUy Road, Butler, N. J. 

Peach, Asso. Prof. Paul Inatr of Stat., N G, State Coll., Box 6576, College Sta., Raleigh, 


N. C. 

♦Pearson, Prof. E. S. D.So. (London) Dept, of Stat., Univ. Coll,, Gower St., London, 
W. C. 1, Eng., iOA Frognal Lane, Hampstead, London N. W, S 
Peiser, Donald E. M.A. (Columbia) 1117 New York Ave , Brooklyn, N. Y 
Perlo, Victor A M. (Columbia) Economist, U. S Treasury Dept., Wash. D. C., 4^11 
Brandywine Si., N.W., Wash, 18 
Perlsteln, Mae B.A. (Hunter) 177S Vyae Ave., Bronx 60, N Y. 

Perrott, Ivan B. M-A. (Oxford) 17 Widney Manor Road, Solihull, Warwickshire, Eng, 
Perryman, Major James H. M.A. (Wesleyan) Mimstry of works, Lambeth Bridge 
House, London S.B. 1, Gilbert Hotel, Ilfracomhe, Devon, Eng 
Peterson, Andrew 1. MS (Columbia) Qual. Control Mgr , Radio Corp. of America, 
Harrison, N.J., 67 Howell Rd , Mountain Lakes, N.J. 

Petrie, George W., Ill M.S. (Carnegie Inst) Sec Engr , Bethlehem Steel Co., Bethle¬ 
hem, Pa., I 46 S Lehigh Parkway, Allentown 
Pierce, Prof. Joseph A. Ph.D. (Michigan) Atlanta Univ., Atlanta, Ga. 

Piper, Robert I. A.B. (Montana) Plant Staff Asst., S. Calif Tel. Co., 740 So Olive St., 
Los Angeles 65, Calif., iHS San Vicente Blvd, Santa Monica 
Plxley, Asso. Prof. Henry H. Ph D (Chicago) Asst. Dean and Asso. Prof, of Math , 
Wayne Univ., Detroit 1, Mich., 601S4 BnardiJ^ Ave , Detroit 61 „ , 

PHa, Prof. Alfonso P. de Toledo Ph.D. (Sao Paulo) Escola Politechnica, San Paulo, 

Brazil, 1166 Rua MinUlro Godoy , n u 

Pollard, A. H. The Mutual Life & Citizens' Assur. Co. Ltd , Martin Place and Castler- 


reath St., Sydney, N.S.W., Australia 

Pollard, Prof. Harry S. Ph.D. (Wisconsin) Dept of Math., Miami Umv., Oxford, 

Ohio, 6SO Patterson Ave. „ , ^ ^ 

Pope, Otis Ph.D. (Iowa State) Sr. Biometrician, USDA, Tech. Collaboration Brano , 

PostoZ’^nitl^L!^ S 1l^chigl)^S/2/c, 928-18-90 Ward 1606, U 8, Naval Hospital, 

PrelmeS othHa^ A Ph.D. (Columbia) C.P.A E. 42nd St. N. Y. 17, N. Y. 
Preston, Bernard CPA,, 103 Park Ave,, N. Y , N Y £01 W ' 5 ^ 

Price, Prof. G. Baley PhD (Harvard) Dept, of Math., Umv. o£ Kane . 205 IranH 

Strong Hall, Lawrence, Kans., 1B£1 Rhode Island Si. Chambers- 

Priestley, Alice E. M A. (New York) Instr.. Stat. and Math., Wilson Coll., Chambers 

Qu.no3;.'’M.«rl» H. B.A (C.„b,ld^) Sij,. MtoW » 

Harponden, Hsrta, Bng., S Chmltad BI-. Bu ° j.,, ,30, Roohes- 

Rafferty, J. Allan B.B. (Harvard) Medical Student, Pfc. AblP (Aus; uo , 

ter Med. W K Kellogg Foundation, Battle Creek, 

Rakesky, Sophie M.S. (Michigan) Stat., W. JA. Aciiogg rouu a 

RandMI,“Vob!Tf j“ BH''(Yale) Grad. Student. Columbia Univ., N. Y., N. Y., £58S 

Agency, Wash, D. C,, SOS Shirtey St - ^ ^ Calif Berkeley, 

Rappaport, Gladys B A (Hunter) Ass't. Dept of Math-, urn 

Calif , 2401 Durant Ave , Berkeley 4 



542 


MEMBERS OE THE INSTITUTE 


lUtkowm, Elsie B.A, (Hunter) 3280 Spencer Drive, Bronx, N. Y. 

Raybould, Ethel H. Univ. of Queensland, Brisbane, Australia 

tReed, Prof. Lowell J. Ph.D. (Pennsylvania) Dept, of Biostatistios, Vice Pres. Johns 
Hopkins Univ., School of Hygiene and Public Health, 616 N. Wolfe St., Baltimore 6, 
Md., S^OO Duvall Ave., BaUinwre 18 

Rees, Prof. Carl J. Ph.D. (Pennsylvania) Dept, of Math., Univ. of Del., Newark, 
Del., iSO E. Main Street 

Regan, Prof. Francis Ph.D. (Michigan) St. Louis Univ., St. Louis, Mo., W. Terre 
Haute, Ind. 

Reid, David B. W. B.A. (McGill) 148 Hopetiale Ave., Toronto, Ont,, Can, 

Reiner, Mae B.A. (Hunter) 170 Second Ave., N Y., N. Y. 

Reitz, Asso. Prof. William Ph.D. (Wisconsin) Dept, of Educ., Wayne Univ., 6272 
Second Blvd., Detroit 2, Mich., 980 Atkinson 
Reno, Franklin V. A.M. (Virginia) Math., Ballistic Res. Labs , Aberdeen Proving Gd., 
Md., 44118 S Bannock St., Englewood, Colo. 

Reynolds, John H. M A. (Univ. of the South) Tech. Control Stat., Celanese Corp. 
of Amer., Rome, Ga , 4^8 B. Third St. 

Reynolds, Wm. A. M.A. (California) Res Assoe., Nat’]. Broadcasting Co., 30 Rocke¬ 
feller Plaza, N. Y. 20, N. Y., iO E. 100 Si., N. Y. 89 
Rhodes, Joseph S. 4787 Homer Ave., Suitland, Md 

Rice, Asso. Prof. J. Nelson Ph.D. (Catholic Univ. of A.) Catholic Univ. of Atner., 
Washington, D. C., SS88 ISlh St,, N.E., Wash. 17 
Rice, William B. A.B. (Davidson) Consulting Bus. Stat., Rm. 612, 117 E. Calif. St., 
Pasadena, Calif., 908 S. Baldwin Ave., Temple City 
Richardson, Prof. C. H. Ph.D. (Michigan) Buckncll Univ., Lowisburg, Pa., 401 S. 
Sixth St. 

♦Rider, Prof. Paul R. Ph.D. (Yale) Washington Univ., St Louis 6, Mo. 

Rlordan, John B.S. (Yale) Member Tech. Staff, Bell Tol. Ijibs., Inc., 463 West St., 
N. Y. 14, N. Y., 71 Flower Ave., Hastings on Hudson 
Rlpandelll, John S. B.A. (Columbia) Miles M. Dawson & Son, Consulting Actuaries, 
600 6th Ave., N Y., N. Y. 

Robbins, Asso. Prof. Herbert E. PhD. (Harvard) Inst of Stat., Univ., of N. C , Chapel 
Hili, N. C 

Roberts, Jean M.S. (Minnesota) Child Welfare Res. Analyst, Div. of Social Welfare, 
St Paul, Minn , 39 So. Avon. Apt. SB, St Paul S 
Robinson, Selby L. Ph.D (Iowa State) Inatr., Coll, of the City of N. Y., 139th St. and 
Convent Aves., N. Y , N. Y., 98 Hamilton Ave., Yonkers 
Robinson, William S. Ph.D. (Columbia) Lecturer, Columbia Univ., N. Y., N. Y., 488 
W. 180th St. 

Rock, Sibyl M. BB (California) Res. Asso., Consolidated Eng Corp., 020 N. Lake Ave., 
Pasadena 4, Calif., 1810 N. Sinalda, Pasadena 7 
Rodal, Juan A. Ph.D. (Buenos Aires) Dr. en Cienoias Eoonomicas, Univ. of Buenos 
Aires, Buenos Aires, Argentina, Aviles 3755 

Rodrigues, Milton da Silva Dr. (Brazil) Prof, of Stat, Univ. of Sao Paulo, Brazil, 
Caixa Postal 10S~B 

Rolfe, Mrs. Kathryn Benson M.S. (U. of Washington) Asso. in Math., Univ. of Calif., 
Coll, of Agric., Davis, Calif., Clarksburg 

Romlg, Harry G. Ph.D (Columbia) Member of Tech. Staff, Bell. Tel Labs., Inc., 
463 West St, N Y., N. Y. 

♦Roes, Charles F. Ph.D. (Rice) Pres , Econometrio Inst., Inc., 600 Fifth Ave., N. Y., 
N. Y. 

Roaander, A. C. Ph.D. (Chicago) Bur. of International Review, Wash. D. C., 7900 
Lynnbrook Dr., Bethesda 14, Md 



MEMBEHS OP THE INSTITUTE 


543 


Rosen, Daniel I. A.B, (Columbia) Medical Stat, Office of Surgeon Gen War Dent 
The Pentagon, Wash., 25, D. G., 3415 SSth St., N.W . Rm. m-1, Wash. iS 

Rosenblatt, Alfred Calls Atahualpa 1B2, Miraflores, Peru 

Rosenblatt, David B.S. (OCNY) Asst Stat., 12 Chesapeake St., S W., Wash D C 

Roshal, Sol M. B.S. (Chicago) Psych Res. Project (C. C,), Lincoln Army Air Field 
Lincoln, Nebr. ’ 

Ross, Frank A. Ph.D (Columbia) Thelford, Vermont 

Rothbard, Murray S70 Central Park West, JV. Y., N. Y 

Rubin, Ernest M.A. (Columbia) Stat., US Dept, of Justice, Immigration and Natural¬ 
ization Service, 1500 Chestnut St., Philadelphia 2, Pa., 7I0A Emerson Ave Uvver 
Darby 

Rubin, Herman S.M (Gboago) Res. Asst., Cowles Comm, for Res. in Econ., Umv. 
of Chicago, Chicago, Ill., 7148 S E End Ave 

Rudnickl, Alei M.A. (Columbia) Stat., Econometric Inst., 500 5th Ave., N Y. N. Y., 
KPS Lorimer St., Brooklyn SS 

Huger, Asbo. Prof, Henry A, Ph.D. (Columbia) Retired, Columbia Umv, N. Y., N. 
Y , BS6 N. Main St., Wellington, Ohio 

Rule, Wayne R. M.S. (Iowa) Metropohtan Life Ins Co , 1 Madison Ave,, N. Y. 10 

N Y. 

Rulon, Prof. Phillip J. PhD. (Minnesota) Dept, of Educ., Harvard Univ., Cambridge 
38, Maas , 10 Craigie St. 

Rupp, William B. Supv , Qual. Control, RCA Victor Div., Radio Corp. of America, Har¬ 
rison, N. J., SO Dodd St., E. Orange 

Ryan, Asso. Prof. Thomas A. Ph.D. (Cornell) Dept of Psychology, Morrill Hall, 
Cornell Umv., Ithaca, N. Y 

Sachs, Rose B.A. (Gouoher) Stat., 14 S 8 R St, N.W , Wash. 9, D 0. 

Soldel, Frank M.A. (Michigan State) Grad. Student, Columbia Univ , N. Y , N. Y., 
40 W. 88lhSt., N. Y. Si 

Salerno, John B.A. (Brooklyn) Math., U. S. Coast and Geodetic Survey, 541 Wash. 
St., N. Y., N, Y., BSO Lincoln Ave., Brooklyn 8 

Salklnd, William MBA (Chicago) Stat., U. S. D. A., Wash , D. C., Sli9 Kay St , N.W., 


Fosli. 7 

Sandellus, D. Martin Fil. kand. (Stockholm) Baltzar von Platensg. 5, VI, Stockholm, 
Sweden 

Sandomlre, Marlon M. A.B (Hunter) Stat., Navy Dept, Bur. of Ships, Code 333, 
Wash. 26, D. C., 14 S 0 Crittenden Si., N W., Wash 11 
Sard, Asst. Prof. Arthur Ph.D (Harvard) Dept, of Math., Queens Coll., Flushing. 
N. Y,, 146-19 Beach Ave 

Sasuly, Mai M.S. (Chicago) JK106,1029 Vermont Ave., N.W., Wash. 6, D C. 
Satterthwalto, Franklin E. Ph.D (Iowa) Aetna Life Ins. Co., Harrford, Conn., S4 


Carol Dr., Manchester 

Saunders, Robert J. B.S. (Mass. Inst, of Tech ) Mohawk Carpet Mills, Amsterdam, 

N.Y.,11 SlewartSt. , ^ , 

Savage, Leonard J. Ph.D, (Michigan) Special Bockefeller Fellow, Rockefeller Founda¬ 
tion, 49 W. 49th St , N. Y., N. Y. ...... u * 

Schaeffer, Esther B. A.M. (Michigan) Tech. Asst., Stat. Lab., Umv of Mich , Ann 

Arbor, Mich., 1S7 N. Slate St. ...... n , r 

‘Scheffe, Asso. Prof. Henry PhD. (Wisconsin) Bngipeenng Dept, Umv. of Calif. 

atL A., Lob AngeloB 24, Cald., ISIS Acton St., Berkeley S t. n 

Schell, Emil D. M A. (Western Reserve) Stat., Bur. of Labor Stat., Wash. 26, D. C., 
S 440 JV. IS Rd., Arhngton, Va. 

Scherl, Bernice M.A. (Columbia) Stat.. Sohenley Res. Inst., Ino., 350 5th Ave., N. Y., 
N y., m E S06 SI., N y. B8 



544 


MEMUKUH OP THE IXSTITTITE 


ScWetroma, WllUam B.S S. (CCN Y) Rob. AbbI , aiS E imh El., .V Y , N. Y. 
ScbllllnK, Asst. Prof. Walter M.D. (Harvard) Asst. Ghniral Prof of Mcdicmp, Stan¬ 
ford Medical Schwl, Stanford Univ. Hospital, San Francisco 15, (^hf., 8008 IfosA- 
tnglon Si., San Franciaco 9 

Sclilorek, Mary A. A.B. (Adelphi) St. Stat. and Sr. Supv., Nat. Broadcasting Co., 30 
Rockefeller Plaza, N, Y 20, N. Y., IBl Northern Parkway, Hempatead, L. L 
Schmalz, William H. B.S.A. (Toronto) Tech. Supt., Merchants Rubber Factory, 
Dominion Rubber Co. Ltd., Kitchener, Ont., Can., SI lireuhaupl Si. 

Schneberger, Richard H. Dir. of Training, Edison Gonoml Elec. Appl. Co,, S600 W. 

Taylor St., Chicago, Ill., 606 Gary .dee., WheaUm 
Schoenbauffl, Prof. Bmll Ph D. (Prague) Dept, of Applied Math., School of Natural 
Science, Univ. Charles IV, Prague, Czechoslovakia, Praha XIX, I'erronaka 88 
Scbrock, Edward M. B.S. (Pittsburgh) Chief, Stat. Analysis Branch, Lab. Service 
Div , Devel. A Proof Services, Ord. Rea. A Devel. Center, Aberdeen Proving Gd., 
Md., 83 Liberty St., Aberdeen 

Schug, Howard L. B.S (lAfayetto) Regional Stat, Fed. Public Housing Authority, 
San Franciaco, tWif-, 830 Forest A»b., Palo Alto 
Schultz, Asso. Prof. Andrew S., Jr. Ph.D. (Cornell) Cornell Univ., Ithaca, N. Y., 
830 Renmck Dr. 

Schumacker, Prof. Francis X. B.S. (Michigan) School of Forestry, Duke Univ., Durham, 
N. 0. 

Schwartz, David H. B.S, (CCNY) Director of Quality Assurance, Quartermaster Corps 
Inspection Service, 111 E. 16 3t., N. Y., N. Y., 8S-B7118 St., Kew Gardena, L. I. 
Schweitzer, Morton D, Ph.D. (Columbia) Stat. and econ. consultant, 60 West lOtb 
St., N. Y. 11, N. Y 

Secrlst, Horace A. B.S. (Plarvard) Dir. of Ros., Kendall Mills, Div. of the Kendall 
Co,, Walpole, Mass. 

Seeley, Sherwood B. Ch.E. (Now York) Dir, of Rea., Joseph Dixon Crucible Co., 
Jersey City 3, N. J., 70 Park Lane, Qrymea Hill, Staten Island 1 
Segal, Irving E. Ph.D. (Yale) Asst., Inst, for Advanced Study, Princoton, N. J. 

Sells, Saul B, Ph.D. (Columbia) Asst, to Pres., A. B. Prank Co., San Antonio 6, Texas, 
306 Abiso Ave., Alamo Hla., San Antonio S 

Seth, Gohlnd R. M.A. (Delhi) Student, 1346 John Jay Hall, Columbia Univ., N. Y, 
27, N. Y., 8 Imperial Ave., Old Viceregal Eatale, Delhi, India 
Shannon, Claude E. Ph.D. (Mass. Inst of Tech.) Member, Tech. Staff, Bell Tel. Labs., 
463 West St., N, Y., N, Y. 

Shaw, Byron T. Ph.D. (Ohio State) Pnn. Agronomist, Plant Industry Station, Belts- 
ville, Md. 

Shaw, Lawrence W. M.A. (Pennsylvania) Stat,, Tuberculosis Control, U. S. Pub. 

Health Service, Bethesda, Md., 8907 Oneida Lane 
Shelton, William A. A.M. (Chicago) Sr. Transportation Economist, Maritime Comm., 
Econ. and Stat. Res,, Wash. D. C., 3811 Tennyson St., N.W. 

Shepard, Ronald W. Ph.D. (U. of Calif., Berkeley) Lecturer, Umv, of Calif,, Berkeley, 
Calif, 1885 Berkeley Way 

Sheppard, David B.S. (Yale) Stat. Hq , AAF, Wash., D. C., 8781 Terrace Rd., S.E., 
Wash. SO 

Sherman, Jack Ph.D (Calif. Inst, of Tech.) Res. Chemist, The Texas Co., Port Ar¬ 
thur, Texas, Ro* 758 

♦Shewhart, Walter A. Ph.D. (California) Res. Engr Boll Tol. Lab., Murray Hill, N. J., 
158 Lake Dr., Mountain Lakes 

Shulman, Harry M.A. (Columbia) Economist, OPA, Wash., D. C., 4^3 Randolph St,, 
N.W, Wash. 11 

Shyhekay, Derso S. Ph.D, (Budapest) Pres., Ind Res. Council, Box 324, Minneapolis 
1, Minn. 



MEMBERS OP THE INSTITUTE 


545 


Slegeltuch, Norman B.S (CCNY) 2201 Caton Ave, Brooklyn 26, N. Y 
Sllber, Jack B.S. (Chic^o) Instr. in Math., Soosevelt Coll , 231 S Wells St., Chicago 
4, Ill., 4308 N. Spnngjield Ave., Chicago SB ^ 

Sllvelra, Prof. Fernando R. da Dr. (Brazil) Prof, of Educ Stat., Institute de Edueacao 
RuaManz e Barroa, 273, Rio de Janeiro, Brazil, Rua Francisca Sales, 90 {Jacarepagua) 
Simmons, Walt R. M.A. (Kansas) Sr. Labor Boon., War Manpower Comm Wash. 

D. C. Lt. USNRR, Bur of Ships, Navy Dept., 4SS4- N. Snd Rd., Arlington^ Va 
Simmons, Willard R. M A. (Duke) Head of Stat Sec , Food and Automotive Ration¬ 
ing Div., OPA, Wash., D. C., B1S0 N. Wash. Blvd , ArUngton, Ya. 

Simms, Clifford R. M.S. (Michigan) Mgr., Cleveland Office, B. E. Wyatt Co , 706 
Leader Bldg , Cleveland, Ohio, ISIB Arlington Ave. 

Simon. George B. Ed.M. (Harvard) Chief, Analysis and Res. Umt, Psychological Sec. 
Office of Surgeon. Hq AAFTC, Barksdale Field, La., 7Sff Blue Hill Ave., Dorchester, 
Mass. 

Simon, Leon G. Pension Consultant, 393 7th Ave., N Y., N. Y 

*Slmon, Leslie E. B.S (USMA, West Point) Col O D , Dir Ballistic Res. Lab., Army 
Ord. Dept , Aberdeen Proving Gd , Md. 

Simpson, Tracy W. E.E. (Rlinois Inst, of Tech.) Sales Promotion Mgr., Marchant 
Calculating Machine Co., 1476 Powell St, Oakland 8, Calif., S90$ Forest Ave Berke¬ 
ley S 

Simpson, William B. M.A (Columbia) Dept of Ecom , Univ. of Chicago, Chicago, HI. 
Skalak, Blanche A.M, (Columbia) Stat., Bur. of the Census, Wash. 25, D. C., 464 s 
Hillside Rd., S.E., Wash. 19 

Smart, Prof. L. Edwin Ph.D. (Ohio State) Ohio State Univ , Columbus 10, Ohio, 4 IO 
King Aca., Columbus 1 

Smart, Oliver M, P.H.B. (Yale) Engr , Naval Air Exp. Sta., Philadelphia Navy Yard, 
Philadelphia, Pa., Hotel Parker, ISth and Spruce Sts, 

Smith, Prof. Edward S. Ph.D (Virginia) Dept, of Math., Univ of Cincinnati, Cin¬ 
cinnati 21, Ohio, 1 Hedgerow Lane, Cincinnati SO 
Smith, Asst. Prof. Frank E. Ph.D. (Catholic) Brooklyn Coll., Brooklyn, N Y. 

Smith, Prof. James G. Ph.D (Princeton) Princeton Univ., Princeton, N J., 80 Jlfurroj/ 
PL 

Smith, Joan T. B S. (Minnesota) Accountant, Wood Conversion Co , 19th FI W., 
First Natl. Bank Bldg., St. Paul, Minn , 673 E. Nebraska Ave., St. Paul 6 
Smith, John H. Ph.D. (Chicago) Acting Chief Stat., Bur. of Labor Stat., Wash. 26 
D. C., 6110 14th St., N.W., Wash. 11 

Smith, Robert T. HI Stat. Analyst, Interstate Commerce Comm. Wash , D. C., 9S7 
leih St., S. Arlington, Va. 

♦Snedecor, Prof. George W. M A (Michigan) Dir. of Stat. Lab., Iowa State Coll., 
Ames, Iowa SSI Forest Olen 

Sobczyk, Andrew Ph.D. (Princeton) 63 Pine Ridge Rd., Arlington 74, Mass. 

Sobel, Milton M.A. (Columbia) Asst, in Math. Stat., Columbia Univ., N. Y , N. Y., 
38 Elliot Place, Bronx BS , 

Solomon, Herbert M.A. (Columbia) Instr., Coll, of the City of N Y., N. Y., N Y , 
801 E Tremont Ave,, Bronx 60 

Sousa, Alvaro P« do B E. (Liverpool) Vice-Governor, Banco de Portugal, Monserrate, 
Rua Infante de Sagres, Estonl, Portugal 

South, Asso. Prof. Dudley E. Ph.D. (Michigan) Dept, of Math., Umv. of Ky., Lexing¬ 
ton, Ky. 

Spaney, Emma M.S. (CCNY) Stat., Natl. League of Nursing Educ., Coram. on Measure¬ 
ment and Educational Guidance, 1790 Broadway, N. Y., N. Y., 9144 108th St., Rich¬ 
mond Hill 18 « .-I „ XT n 

Spaulding, Aaa T. M.A. (Michigan) Actuary and Asst. Secy.-Comptroller, N. C. 

Mutual Life Ina. Co., Durham, N. C., 1608 Ldncgln St. 



64G 


MKMBKHS OF TUB INSTITUTE 


Speaker, Asso. Prof. Guy G. A.M. (Indiana,) Micliigan State Coll., E. I^nnalng, Mich., 
Box 61 

Spiegelman, Mortimer MBA. (Harvard) Supv. of Math. Res. Stat. Bur., Metropolitan 
Life Ins. Co., 1 Madison Avo. N. Y. 10, N. Y,, &0 Riverside Dr., N. Y. S4- 
Spoerl, Charles B A. (Harvard) Asat. Trcaa., Aetna Life Ina. Go., Hartford, Conn, 
Sprengel, Herbert J. M S. (Illinois) Quai. Control Engr., SOS N. Lombard Ave., Oak 
Park, III, 

Springer, Melvin D, M.S. (Illiuoia) Asst., Univ. of Ill., Urlmna, Ill., 1001 W. IlhruHs Si. 
Springer, William M. Ch.Bng. (Columbia) Vice-Pros, in Charge of Res., Bristol Myers 
Co., 225 long Ave,, Hillside, N. J., R.F.D. Ill, Basking Ridge 
Stauher, B. Ralph B.S. (State Coll, of Wash.) Chief, Relocation Planning Div., War 
Relocation Authority, Wash., D, O.,8701 Bexhill Dr., Rock Creek Hills, Kensington, 
Md. 

Steele, Floyd G. M.S (Calif. Inst, of Tech.) Stat. Analyst, Douglas Aircraft, 1818S 
Roosevell Highway, Pacific Palisades, Calif. 

Steen, Jerome R. B.S. (Wisconsin) Mgr., Qual. Control Engineering, Sylvania Elec. 
Prods., Inc., Emporium, Pa. 

Stehn, John R. Ph.D. (Wisconsin) Physicist, Winchester Repeating Arms Co., Now 
Haven 4, Conn., S7 Norlhsule Rd., Hamden H 
Stein, Arthur M.A. (Columbia) Stat., Ballistic Res Ijab., Aberdeen Proving Gd., 
Md., S Liberty Si., Aberdeen 

Stein, Charles M. B.S. (Chicago) Student, Columbia Univ., N. Y., 109-69 Colfax St., 
Si Albans, 11 

Stein, Irving B S. (Maas. Inst. Test) Qual. Control Engr., Polaroid, Ino., 10 Forest 
Park Ave., Adams, Mass, 

Steinberg, Gunther T. M.A. (Chicago) Math. Economist, Western Elec. Co., 196 Broad¬ 
way, N Y. 7, N. Y., 4f Hamilton St; E. Orange, H, J, 

Steinberg, Joseph B.S. (CONY) Stat., Bur. of the Census, Wash. 26, D. C., S 04 I N. 
Capitol St., Washington 11 

StelnhauB, Henry W. Ph.D. (Gottingen) Res. Asst., Equitable Life Ass. Soo., 303 7th 
Ave,, N. Y., N. Y., Elm Ridge Farm, Searsdale 
*Stephan, Prof. Frederick F. M.A. (Chicago) Dept, of Sociology and Stat., Cornell 
Univ., MoGraw Hall, Ithaca, N. Y., 101 Eddy SI. 

Stephens, William H. M.Sc. (Queen’s) British Commonwealth Soiontilic Office, 1786 
Massachusetts Ave., Wash , D. G., 3807 Mililary Rd., N.W. 

Sterglon, Andrew P. M.S. (Moss. Inst. Tech.) Stat. and Qual. Control Engr., Corning 
Glass Works, Corning, N, Y., IS Bacon Si., Wellsboro, Pa. 

Stemhell, Arthur I. B.A. (New York) Procedure Analyst, Metropolitan Life Ins. 

Co,, 1 Madison Ave., N. Y. 10, N. Y., S9-3S 69lh St., Woodside, L. I. 

Stevens, Milton S. M.A (Columbia) Dir. of Special Projects, Time, Ino., Time and 
Life Bldg,, Rockefeller Center, N Y., N. Y., 708 Waesche Ave , Brooklyn 39 
Stevenson, Prof. Guy Ph.D. (Illinois) Head, Math. Dept., Univ. of Louisville, Louis¬ 
ville, Ky. 

Stewart, Oscar F. 17B Pinehwst Ave., N. Y. SI, N. Y. 

Stlbltz, George R. Ph.D. (Cornell) Consultant in App Math., 393 S. Prospect St., 
Burlington, Vt. 

Stlgler, Prof. George J. Ph.D (Chicago) Dept, of Boon., Brown Univ., Providence, 
R. I, 

Stock, J. Stevens M.A, (American) Lt. USNR, Hdq. Stat. Soo. Div. of Shore Est. 

and Civilian Personnel, Navy Dept., Wash., D, C , 8608 Oarfield St., Belhesda, Md. 
Stone, Goldie F. A.M. (New York) 678 Dawson St., Bronx, N. Y. 

Stone, John R. N. M A. (Cambridge) Dir., Dept of Applied Econ., King’s Coll,, Cam¬ 
bridge Umv , Cambridge, Eng. 

Stott, Alexander L. A.B. (Harvard) Staff Ass’t, Earmngs Div., Treasury Dept., Am. 
Tel. and Tel. Co., 196 Broadway, N. Y., N. Y., 18 Unden St, Great Neck, L. J. 



MEMBERS OP THE INSTITUTE 


547 


Strieby, J. Glenn B.S. (Iowa Wesleyan) Tech. Dept., Bamberly-CIark Corp , Neenah, 
Wis., 616 E. Circle St., Appleton 

Studley, Duane M. A.A. (Colorado) Clerk, War Dept,, 16th A. F., Peterson Field, 
Colo., 1311 Cheyenne Blvd., Colorado Springs 
Sturtevant, John V. A.B. (Oberlin) Development Metallurgist, Carnegie-Ill. Steel 
Corp., Pittsburgh, Pa , SIS McClellan Dr., B. D 6, Pittsburgh 27 
Sullivan, John W. So.D. (Maas. Inst. Tech.) Metallurgist, American Iron and Steel 
Inst., 360 Fifth Ave , N. Y. 1, N Y. 

Swanson, A. G. Ph.D. (Michigan) Asst Ohm., Dept, of Math , General Motors Inst., 
Fling, Mich. 

Szatrowskl, Zenon PhD. (Northwestern) Instr., Eton. Dept, Northwestern Univ., 
Evanston, Ill 

Taylor, Thomas S. Ph D. (Yale) Dir of Res , U. S. Testing Co., 1416 Park Ave , Ho¬ 
boken, N. J , 46 Grover Lane, Caldwell 

Topping, Benjamin J. Ph D (Ohio State) Stat, Bur. of the Census, Wash. 26, D. C., 
6602 Early St , N.E., Wash ,19 

Thom, Herbert C. S. M S. (George Washington) Sr. Meteorologist, U. S Weather 
Bur., Iowa State Coll , Ames, Iowa, 227 Sheldon 
Thompson, Juanita Qual. Control Inspector, ji.117 W. 26th St , Chicago 23, III 
Thompson, Louis PhD. (Clark) Naval Ord. Test Station, Dir. Res. and Dev , Inyo- 
kern, Calif. 

Thompson, Prof. Sidney L. M S. (Tulane) Ala. Polytechnic Inst , Auburn, Ala 
Thompson, Walter H. MS (Iowa) Res Dept. Statistician, Ted. Bates Inc., 630 Fifth 
Ave , N. Y , N. Y , 4lh Ave., E. Orange, N J. 

♦Thompson, William R. PhD. (Yale) Sr. Biochemist, N. Y State Dept, of Health, 
Div of Labs, and Res , New Scotland Ave., Albany, N. Y., 1 Darroch Rd., Delmar 
Thomson, George W. B.S.E. (Michigan) Res Chemist, Ethyl Corp., Chem Res Lab., 
1600 W. 8 Mile Rd , Detroit 20, Mich 

Thomson, Prof. Godfrey H. Ph D (Strausburg) Head of Eduo. Dept., Univ. of Edin¬ 
burgh, Scotland, Moray House, Edinburgh 8 
Thurstone, Prof. Louis L. Ph.D. (Chicago) Univ of Chicago, Chicago 37, HI. 
Tomlinson, John R. AB (Pennsylvania State) Ord Engineer, Aberdeen Proving 
Gd , Md , 32 Liberty St , Aberdeen 

Tomlinson, Malcolm C. W. B S (Pennsylvania) Ord. Engr., Res and Dev. Service, 
Office of Chief of Ord , ASF, Pentagon Bldg., Wash., D. C., 3820 Southern Ave., S.E. 
Tompkins, Haldan M. Head, Works Control Lab , National Carbon Co , Fostoria, Ohio, 
P.O. Drawer 191 

Toops, Prof. Herbert A. PhD. (Columbia) Ohio State Umv., Columbus, Ohio, 14^0 
Cambridge Blvd 

Toralballa, L. V. Ph.D (Michigan) Fordham Univ., N. Y., N. Y., 26 Woodbridge, 
Highland Park, N. J 

Torrey, Prof. Marian M. PhD. (Cornell) Goucher Coll., Baltimore, Md., Newfane, 


yt. 

Torrey, Mary N. M.A. (Columbia) Member Tech. Staff, Bell Tel. Labs., 463 West 
St., N. Y 14, N. Y., 69 W. 10th St,N.Y.ll . ^ ^ 

Tosti, Carlo R. M A (Stanford) Capt., AC, Adm. Engr., Techmcal Control Officer, 
Wright Field, Dayton, Ohio, 601 Forest Ave\ Dayton 6 „ -a a 

Toulouse, Julian H, Ph.D (Iowa State) Chief Engr., Quality and Specifications Sec., 
Owens-111 Glass Co , Toledo. Ohio, P.O. Box 1036-1036, Toledo 1 ^ 

Treanor, Glen B A. (Minnesota) Prin. Tax Econ., Bur. of Internal Rev., Rm. 2232, 

TreloM?Assm Prof. Alan E. Ph.D. (Minnesota) Biostotistics Dept., Schwl of Pub. 

Health, Umv of Minn , Minneapolis 14. Minn., 2466 Bev^lyRd., 5t. Paul 4 
Trowbridge, Frederick Qual Control Engr., Sentinel Radio Corp., 2020 Ridge Ave., 
Evanston, Ill., 6766 N MapUwood Ave., Chicago 



548 


MEMBBHvS OP THE INSTITUTE 


Tniksa, Ladislar Pii.D, (Charles Univ.) Dir., Lecturer, in Math. Stat, Charles Univ., 
Praha XI-1800, Czechoslovakia 

Tsao, Fel PhD, (Minnesota) National Central Univ., Chungking, China 

Tucker, Prof. Albert W. Ph D. (Princeton) Dept, of Math., Princeton Univ., Fine 
Hall, Princeton, N. J., /{. t 

Tucker, Ledyard R. B.S. (Colorado) 50i Maph Si., Princeton, N. J. 

t*Tukey, Asst. Prof. John W, Ph.D. (Princeton) Dept, of Math , Princeton Univ,, Pino 
Hall, Princeton, N. J. 

Tuttle, Charles R. M. Sc.D. (Nancy) Ord. Kngr., War Dept., Office of Ord., Wash., 
D. C., 7ge Slat Si., S.E., Wash. 19 

Tweedy, Marjorie A. L, B.S. (Ohio State) Economist, OPA, Wash., D. C., Pox B8S6, 
Betheada, Md, 

Tyler, Asso. Prof. George W. M.A. (Duke) Va. Polytech Inst., Blacksburg, Va. 

mtinan, Joseph L. B A. (Buffalo) Grad, Student, Stanford Univ., Calif., IS Brunswick 
Blvd., Buffalo, N. Y. 

Updike, Arthur T. Mgr., Qual, Control Dept., US Naval Ordnance Plant, Indianapolis 0 
Ind. 

Upholt, William M. PhD. (California) Entomologist, Exec. Asst., Tech. Devel. Div., 
Communicable Disease Center, Box 547, Savannah, Ga., 1S09 E. BSlh St. 

Urle, Frank D. A.B, (Michigan) Supt. of Inspection, Elgin National Watch Co., Elgin, 
Ill. 

Vajda, Stefan Ph.D, (Vienna) Stat., Admiralty, Whitehall, London, Eng., 54 Chapel 
Way, Epsom, Surrey 

Vandlvere, Edgar F., Jr. M.A. (Duke) Radio Engr., Fed. Communications Comm., 
Wash., D, C., S640 S. Troy Si., Arlington, Va. 

Van Voorhls, Asso. Prof. Walter R. Ph.D. (Pennsylvania) Penn Coll., Cleveland, Ohio, 
1SS4 Compton Rd., Cleveland Heights 

Van Winkle, Fkof. Edward H. M.B.A, (Harvard) Dept, of Bus. Stat., Rensselaer Poly- 
teoh. Inst., Troy, N. Y., 1 Seymour Cowl 

Vatnsdal, Asso. Prof. John R. Ph.D. (Michigan) Dept, of Math., State Coll, of Wash,, 
Pullman, Wash., 1916 B St. 

Vezeau, Asst Prof. Waldo A. M.8. (St. Louis) St. Louis Univ., St. Louis 3, Mo,, S96S 
Wyoming, St, Louts 16 

Vickery, Asso. Prof. Charles W. Ph.D. (Texas) P.O. Box dSi, Dayton 1, Ohio 

Votaw, David F,, Jr. M.A. (Princeton) Rea. Asso., Princeton Univ., Princeton, N. J , 
SO Mercer St. 

Wadley, Francis M. Ph.D (Minnesota) Stat. Consultant, U. S. Dept, of Agric., Wash., 
D. C , SSIS N. Albemarle, Arlington, Va 

Wadman, Alton J. B.S. (Mass Inst. Tech.) Chief, Burst Pattern Analysis Sec., VI 
Fuse Div , Naval Ord. Lab., Wash , D. C., 87*0 Coleanlle Rd , Silver Spring, Md 

**Wagner, Prof. Charles C. Ph.D. (Michigan) As6t. Dean, School of L A., Pa. State 
Coll., State College, Pa., 13S Sparks Bldg. 

Waite, Prof. Warren C. Ph.D, (Minnesota) Univ. of Minnesota, Div. of Agrio. Boon., 
Univ. Farm, St. Paul 8, Minn. 

Waksherg, Joseph B.S. (C. C. N. Y.) Econ. Analyst, Bur. of the Census, 14 SS Saratoga 
Ave., N.E., Wash. 16, D. C. 

♦Wald, Prof. Abraham Ph.D. (Vienna) Dept, of Math. Stat., Columbia Univ., N. Y., 
N. Y., $41 w. loath St., N. Y. SS 

Walker, Prof. Helen M. Ph.D. (Columbia) Dept, of Educ.. Teachers Coll., Columbia 
Univ., N, Y. 27, N. Y., 108 Momingside Dr. 

‘Wallis, Prof. W. Allen B.A. (Minnesota) Univ. of Chicago, Chicago 37, III 

Walsh, John E. M.A. (California) Grad. Student, Princeton Univ., Princeton, N. J. 


*• Deceased, 



MEMBERS OP THE IKBTITTJTE 


649 


Walter, Asso. Prof. Robert M. A.M. (Columbia) Dept, of Math., N. J. College for Women 
(Rutgers Univ.), New Brunswick, N. J., S8 Lincoln Ave., Highland Park 
Warqjham, Ralph E. B.A. (Iowa) Managing Dir., Natl. Photocolor Corp , 306 E. 43rd 
St., N. y. 7, N. Y., Chappagua, Weatchesler County 
Watkins, Asso. Prof. John H. Ph.D. (Yale) Yale School of Medicine, New Haven, 
Conn., 318 Willow Si. 

Waugh, Frederick V. Ph.D. (Columbia) Agric. Economist, Office of War Mobil, and 
Recon., Wash., D. C., }008~i8 Si , S., Arlington, Va. 

Weaver, Chalmers L. B.S. (Kent State) Asst. Actuary, New Eng. Mutual Life Ins. 

Co., 601 Boylston St., Boston, Mass-, 104 Beaufort Ave., Needham 
Weber, Bruce T. M.A. (Columbia) Pnnoeton Univ., Princeton, N. J,, if Creecent 
Ave., Summit 

Weber, ComlUe J. Personal Trust Officer, The Chase Natl. Bank, 11 Broad St., N. Y., 
N. Y , P.O. Box 63, Chappaqua 

Week, Frank A. A.B, (Stanford) Metropolitan Life Ins. Co., 1 Madison Ave., N. Y. 
10, N. Y., Miller Rd , Datien, Conn. 

*Welda, Prof. Frank M. Ph.D. (Iowa) Head, Stat. Dept., The George Washington Univ., 
Wash. 6, D C., 7130 Hampden Lane, Bethesda 14, Md. 

Weiner, Louis A M. (Harvard) Stat., Bur, of Labor Stat., U. S. Dept, of Labor, Wash., 
D. C., 4il8 Russell Ave., Mt. Rainier, Md. 

Welngorten, Harry M.A. (Columbia) Tutor, College of the City of N. Y., and Instr., 
Academic Dept., N. Y. Univ., N. Y., 1330 Morris Ave,, Bronx 68 
Weinstein, Joseph M.S. (C.C N.Y.) Res. Analyst, Signal Corps Engr. Labs., Evans 
Signal Lab., Belmar, N, J., IS Washington Village, Asbury Park 
Weiss, Lionel M.A. (Columbia) Gottsberger Fellow, Columbia Univ., N. Y., N. Y., 
HOB Davidson Ave., N. Y. 63 

Weiss, Samuel M.A. (Michigan) Chief, Manpower Estimates Sec., War Manpower 
Comm., Wash., D. C., 3073 S. Buchanan, Arlington, Va. 

Welker, Asst. Prof. Everett L. Ph D. (Rlinois) Dept, of Math., Univ. of Ill., Urbana, 
Ill., lilS W. Charles St., Champaign 

Welsh, Charles A. PhD (New York) iOS N. Trenton St., Arlington, Va. 

Wescott, Asst. Prof. Mason E. Ph.D. (Northwestern) Dept, of Math., Northwestern 
Univ., Evanston, Ill., SOU Beechwood Dr., Wilmette 
Westman, Albert E. R. Ph.D. (Toronto) Dir. of Chem. Res., Ontario Res. Foundation, 
43 Queen’s Park, Toronto 6, Ont, Can., SB Olenayr Rd., Toronto 10 
Weyl, Eric (Cologne) Textile Consultant, Chicopee Mfg. Corp., Manchester, N. H., 
S79 Orange St 

Wherry, Prof. Robert J. Ph.D. (Ohio State) Dept, of Psychology, Univ. of N. O.; 
Chief, Res. and Analysis Sub. Section, Personnel Res. Sec., AGO, 270 Madison Ave., 
N. Y., N. Y. 

White, Prof, A. E. M.S. (Purdue) Kans. State Coll. Manhattan, Kane, 

White, Juliette M.S. (Iowa State) Sr. Fellow, Iowa State CoU., Ames, Iowa 
Whitney, D. Ransom M.A. (Princeton) Grad. Asst., Math. Dept., Ohio State Univ., 
Columbus, Ohio, 366 Crestview Rd., Columbus S 
Wilcox, Sidney W. L.B. (California) Chief Stat., Bur. of Labor Stat., Wash., 26, D. 0., 
P.O. Box 9SB, Ancon, Canal Zone 

WUcoxon, Frank Ph.D. (Cornell) Group Leader, Insecticide and Fungicide, La., Amer. 

Cyanamid Co., Stamford, Conn., E D. HI, Box S9a Riverside 
t*Wllks, Prof. Samuel S. Ph.D. (Iowa) Princeton Univ., Princeton, N. J., 1 Campbelton 
Circle 

WllUams, John D. B.S. (Arizona) S8S1 ISlh St., Santa Monica, Calif. (last address) 
Wilson, Elizabeth W. Ph.D. (Radcliffe) 1 Waterhouse, Cambridge 38, Mass. 

Wilson, Winfred P. M S. (Michigan) Grad. Student in Math., Univ. of Michigan, Ann 
Arbor, Mich., 116 N. State St., Apt. US 



660 


MEMBERS OF THE INSTITUTE 


'Wlnsor, Charles P. Ph.D. (Harvard) School of Hygiene and Pub. Health, Johns Hopkins 
TTniv., Baltimore, Md. 

Wlilck, Grover C., Jr. M.A. (Miahigan) Grad. Student, Univ. of Mioh., Ann. ^jbor, 
Mich., iiO N. Ingalls St. 

*'Wold, Prof. Herman O. Ph.D. (Stookholm) Univ. of Uppsala, Stat. Inst., Odinslund 
2, Uppsala, Sweden 

Wolfenden, Hugh H. PIA, FAS, FAIA, Consulting Actuary and Stat., P.O. Box 63, 
Postal Station K, 2384 Yonge St., Toronto 12, Ont., Can., 18i Jtomdale Heights Dr. 
Wolff, Marlon B.A. (Hunter) Asst. Math. Stat., Stat. Res. Group, Columbia Univ., 
N. Y., N. Y., m4 CroUma Park E., N. Y. 60 
♦Wolfowlta, Asso. Prof. Jacob. Ph.D. (Now York) Dept, of Math. Stat., Columbia 
Univ., N, y. 27, N. Y. 

Woodbury, Max A. M.S. (Michigan) Grad. Fellow, 3200 Angell Hall, Univ. of Mich., 
Ann Arbor, Mich. 

Working, Prof. Holbrook Ph.D. (Wisconsin) Econ , Food Res , Inst., Stanford Univ , 
Calif. 

Wright, C. Ashley M.A. (Princeton) Asso. Economist, Standard Oil Co. (N. J.), 30 
Rockefeller Plaza, N. Y. 20, N. Y., 35 Franklin Ave., Qroton-on-Hudson 
♦Wright, Prof. Sewall So.D. (Harvard) Dept, of Zoology, Univ. of Chicago, Chicago 37, 
ni., 6766 Harper Aiie. 

Wurtele, Zlvla S. M.A. (California) Asst in Math. Stat., Columbia Univ., N. Y., N. Y., 
106 Lexington Ave., N. Y. 18 

Wyckoff, John F. M.A. (Yale) Res. Div , Actuarial Dept., Connoctiout General Life 
Ins. Co., 65 Elm St., Hartford Conn., 78 W. Cesar SL, Newington 11 
Yntema, Prof. Theodore O. Ph.D. (Chicago) Dept, of Bus. and Econ. Policy, Univ. of 
Chicago, Chicago 37, Ill., 6769 Blackstone Ate 
Yood, Bertram M S (C^lif. Inst. Tech.) Grad. Student in Math., Yale Univ., New 
Haven,'Conn., 1648 Yale Station 

Yost, Earl K., Jr. B.S. (Washington and Jefferson) Grad. Asst., Univ. of Oregon, Eugene, 
Oregon 

Youden, William J. Ph.D. (Columbia) Pbys. Chemist, Boyce Thompson Inst, for 
Plant Res., 1086 N. Broadway, Yonkers3, N. Y., 440 Mile Square Rd., Yonkers 6 
Young, Cheng-Pong M.A. (George Washington) Mijor, Ord. Dept., Chinese Army, 
Chinese Supply Comm., 2311 Massachusetts Ave , Wash. 8, D. C 
Young, Louis M.S. (Mass. Inst. Tech.) Supv. of Qual. Control, Weatinghouse Elec. 

Corp., 663 Page Blvd., Springfield, Mass., 60 Cleveland St., Holyoke 
Zelger, Edward C. M.A, (Columbia) Guardian. Life Ins. Co. of America, 60 Union Sq., 
N.Y., N. y. 

Zuckennan, Samuel M.A. (Brooklyn) Math. U. B. Coast and Geodetic Survey, Federal 
Bldg., N. Y., N. Y., 649 E. 95th St., Brooklyn 16 
Zwerllng, Ruth M.A. (Columbia) 1780 Bryant Ave., Bronx 60, N. Y. 

Zwlnggl, Prof. Ernst Ph.D. (Berne) Subdir., Basle Life Ins. Co., Kapellenstr. 28, 
Basle, Switzerland 

GEOGRAPHICAL DISTRIBUTION OF MEMBERS* 

UNITED STATES 

Alabama (3) Oahfobnia (66) 

Aububn. S. Thompson, Abtbsia. Baldwin, 

Uni'vbbsitt. Gill, Lacey. Bbbkblbt. Beckstead, Dix, Eudey, Fix, 

* The Directory and Geographical Distribution are compiled from the addresses as re¬ 
ported by the members. In compiling the geographical distribution this year, the business 
address has been used in preference to the home or mail address. The home or mail address 
is used only in case the business address is not given. 



MEMBERS OF THE IKSTITUTB 


551 


Berkeley Cord'd). Gurland, Hodges, 
Hughes, Jarrett, Kuznela, Iiehmann, 
Massey, Nash, Rappaport, Shepard. 

Burbank. J. Crawford, Ghormloy. 

Davis. Baker, Rolfe. 

Emertvili.e. Bonnar, Eldredge. 

JIawtiiohne Howell. 

Hollywood. Jahn. 

Inyokern Odle, L. Thompson. 

La Jolla. McEwen. 

Long Beach. Mathieus. 

Los Anqeles Alchian, Alter, F. Camp¬ 
bell, C. Chang, Elconin, Guilford, Hool, 
Humm, Larson, Lefever, Miohael, 
Piper, Seheffe. 

Marb Island. Becker 

Oakland. Behrends, T. Simpson. 

Ontario. W. Lewis. 

Pacific Palisades. Steele. 

Pasadena. Nordquist, W. Rice, Rook, 

San Dieqo. Klauber. 

San Francisco. Brontonbrenner, Carlson, 
Oruden, Evensen, Gintzler, Gough, 
Olshen, O'Toole, R. Owen, Schilling, 
Sohug. 

Santa Monica. Maynard, Williams. 

Stanford University Bacon, E Grant, 
Ullman, Working. 

Colorado (6) 

Denver. E Crawford, Dyson. 

Fort Collins. Clark, Guard. 

Peterson Field. Studley 


Bresnahan, Brier, Burington, Burfc, 
Canter, Csplan, Chapman, Chaasan, 
Christopher, Cobb, Cornell, Cornfield, 
Curtiss, Daly, Doming, Dorfman, Dorn, 
Echogaray, W. Evans, Fadner, Fine, 
Flaherty, Girshiek, Graves, Greenhouse, 
Greenwood, Grevillc, Groves, Gurney, 
Hagood, Hammond, Hansen, Hanson, 
Hayden, Hendricks, Hess, Householder, 
Houseman, Hoy, Uumes, Hunvitz, I. 
Jackson, A. Kaitz, H. Kaitz, Kellogg, 
Knudsen, Konijn, Kopp, Kossaok, 
Kullbaok, Kury, Ladd, Lanoastor, 
Lange, Lesansky, Lieberman, Lieblein, 
Likert, Lindsey, Lyons, Marcuse, 
Marks, McGann, F. McIntyre, Mar¬ 
garet Moore, Marjorie Moore, Morrow, 
Mottley, Murray, Nisselson, H. Norton, 
K. Norton, Okun, Osborne, Ostennan, 
Perlo, Pope, Hapkin, J. Rice, Rosandor, 
Rosen, D. Rosenblatt, Sachs, Salkind, 
Snndomire, Sasuly, Schell, Shelton, 
Sheppard, Shulman, Walt Simmons, 
Willard Simmons, Skalak, J. H, Smith, 
R T. Smith, Stauber, J Steinberg, 
Stephens, Stock, Tepping, M. Tomlin¬ 
son, Treanor, Tuttle, Tweedy, Vandi- 
vere, Wadley, Wadman, Waksberg, 
Waugh, Weida, Weiner, S. Weiss, 
Wilcox, C. Young. 

(Florida (1) 

Gainesville. Germond. 


Connecticut (19) 


Georgia (6) 


Bridgeport. Ferris, Kirchen. Athens. Bancroft. 

Hartford. Dorweiler, Elston, Keffer, Sat- Atlanta. Jarvis Barnes, Pierce. 

terthwaite, Spoerl, Wyokoff. Rome. J. Reynolds. 

Middletown H. Arnold, Camp. Savannah. Elmore, Upholt. 

New Haven. Bliss, Fisher, Millikan, No¬ 
land, Ore, Stehn, Watkins, Yood. Illinois (50) 

Stamford. Wilcoxon. 

Argo. G. J. Cox. 


Delaware (3) 

Cannon. Cannon, 

Newark Rees, 

Wilmington. B. Lewis. 

District of Columbia (128) 
Washington. Aitchison, P. Anderson, 


Chicago. Bartky, Brooks, Cooper, Der¬ 
rick, M. Friedman, Gunlogson, Haa- 
velmo, Halmos, Halton, Jacobson, 
Jarmillo, H. Jones, Kaplansky, Klein, 
Koopmans, Leavens, Leipnik, Mans¬ 
field, Marschak, C. Martin, Nichols, 
H. Rubin, Schneberger, Silber, W. 
Simpson, J. Thompson, Thurstone, 


Beebe, Been, Beilinson, B. Benne^;, Wallis, 8. Wnght, Yntema 
Blackwell, Blake, Blanche, R. “BLAUtijS W. Jones 

Bloom, Boddie, Bonis, Brady, Bra n|t,.’!;,f^yf Ai .o\ ^ 




552 


MEMBERS OF THE INSTITUTE 


Elgin, Miller, Uric. 

Evanston. Hildebrandt, Ijonseth, Sza- 
trowakv, Trowbridge, Wcscott. 

Gbbat Lakes. Poston. 

Oak Park. Sprengel 
PocKFOED, M. Elvebaok. 

Pock Island. Cederberg. 

Urbana. Baer, Bower, Carter, Doob, 
M. Springer, Welker. 

Indiana (10) 

Bloomington. P. Gordon 
Fort Watne Hatko 
Gar?. Ede. 

Grbbncastlb. Greenleaf. 

Indianapolis. Borland, Hadley, Updiko. 
Lafatbttb. Burr, Field, JohRson. 

Iowa (15) 

Ames. G. W. Brown, Grump, D. Duncan, 
Federer, Hurwicz, A. TGng, Maloney, 
Mood, Snedecor, Thom, J. White. 

Des Moines. Grobh. 

Iowa Citt. Blommors, A. Craig, Knowler. 

Kansas (4) 

Lawrence. Price. 

Manhattan. Fryer, A. E, White. 

Wichita. Burke. 

Kbntuckt (3) 

Lexington. Goffman, South. 

Lotjisvillb. Stevenson. 

Louisiana (1) 

Barksdale Field. G. Simon. 

Maine (1) 

Orono Monro. 
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Rudmcki, Rule, Saidel, Salerno, 
Savage, Scherl, Sohietroma, vSchlorek, 
Schwartz, Schweitzer, Seth, Shannon, 
Siegeltuoh, L. G. Simon, F. E, Smith, 
Sobcl, Solomon, Spaney, Spiegelman, 

C. Stem, G Steinberg, Steinhaus, 
Sternnell, Stevens, Stewart, G. Stone, 
Stott, Sullivan, W. H. Thompson, 
Toralballa,MaryTorrey, Wald, Walker, 
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Nashville. Densen 
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